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International application No. 
PCT/EP00/02701 



International filing date (day/month/year) 
28 March 2000 (28.03.00) 



Applicant 

BASF PLANT SCIENCE GMBH et al 



The applicant is hereby notified of the following in respect of the priority claim(s) made in the international application. 

1 . [3 Correction of priority claim. In accordance with the applicant's notice received on: 22 June 2000 (22.06.00), 
the following priority claim has been corrected to read as follows: 

US 07 February 2000 (07.02.00) 60/180,687 
even though the indication of the number of the earlier application is missing. 

□ even though the following indication in the priority claim is not the same as the corresponding indication appearing 
in the priority document: 

1>. PI Addition of priority claim. In accordance with the applicant's notice received on: , 
the following priority claim has been added: 

□ even though the indication of the number of the earlier application is missing. 

□ even though the following indication in the priority claim is not the same as the corresponding indication appearing 
in the priority document: 

3. □ As a result of the correction and/or addition of (a) priority claim(s) under items 1 and/or 2, the (earliest) priority date is: 

4. r~| Priority claim considered not to have been made. 

□ The applicant failed to respond to the Invitation gnder Rule 26bis.2(a) (Form PCT/IB/31 6) withi n the prescribed t.me limit 

□ The applicant's notice was received after the expiration of the prescribed time limit under Rule 26bis.1(a). 

□ The applicant's notice failed to correct the priority claim so as to comply with the requirements of Rule 4.10. 

The aDDlicant may, before the technical preparations for international publication have been completed and ! subject t Q the 
^^^^Xt^XhB international Bureau to publish, together with the Internationa M application, information 
concerning i the priority claim. See Rule 26bis.2(c) and the PCT Applicant's Guide, Volume I, Annex B2(IB). 

5. m In case where multiple priorities have been claimed, the above item(s) relate to the following priority claim(s): 

US 07 February 2000 (07.02.00) 60/180,687 



6. A copy of this notification has been sent to the receiving Office and 
Gil to the International Searching Authority (where the international search report has not yet been issued). 
[Tg the designated Offices (which have already been notified of the receipt of the record copy). 



The International Bureau of WIPO 
34, chemin des Colombettes 
121 1 Geneva 20, Switzerland 



Facsimile No. (41-22) 740.14.35 



Authorized officer 

Aino Metcalfe 

Telephone No. (41-22)338.83.38 
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003408873 



PCT/EP00/O2701 



ATENT COOPERATION TFOTY 

From the INTERNATIONAL BUREAU 



PPT 

NOTIFirATIDN OF Fl FCTION 

(PCT Rule 61.2) 
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Commissioner 
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United States Patent and Trademark 
Office, PCT 

zuil ooutn Llark Place noom 
CP2/5C24 

Arlington, VA 22202 
ETATS-UNIS D'AMERIQUE 

in its capacity as elected Office 


Date of mailing (day/mo nth/year) 
21 February 2001 (21.02.01) 




International application No. 
PCT/EPOO/02701 


Applicant's or agent's file reference 
3377/99 


International filing date (day/month/year) 
28 March 2000 (28.03.00) 


Priority date (day/month/year) 
01 April 1999 (01.04.99) 


Applicant 

DAHLQVIST, Anders etal 



1. The designated Office is hereby notified of its election made: 

| X| in the demand filed with the International Preliminary Examining Authority on: 

11 October 2000 (11.10.00) 



| [ in a notice effecting later election filed with the International Bureau on: 



2. The election | X | was 

| | was not 

made before the expiration of 1 9 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 



The International Bureau of WIPO 
34, chemin des Colombettes 
121 1 Geneva 20, Switzerland 



Facsimile No.: (41-22) 740.14.35 



Authorized officer 

C. Cupello 

Telephone No.: (41-22) 338.83.38 



Form PCT/IB/331 (July 1992) 
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INTERNATIONAL PRELIMINARY EXAMINATION REPORT 

(PCT Article 36 and Rule 70) ^ 



Applicant's or agent's file reference 


See Notification of Transmittal of International 


3377/99 PCT 


FOR FURTHER ACTION Preliminary Examination Report (Form PCT/IPEA/416) 


International application No. 


International filing date (day/month/year) 


Priority date (day/month/year) 


PCT/EP00/02701 


28/03/2000 


01/04/1999 


International Patent Classification (IPC) or national classification and IPC 




C12N 15/54 






Applicant 






BASF PLANT SCIENCE GmbH et al. 





1. This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 12 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 



I 




Basis of the report 


II 


B 


Priority 


III 


B 


Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 


IV 


b 


Lack of unity of invention 


V 




Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations suporting such statement 


VI 




Certain documents cited 


VII 




Certain defects in the international application 


VIII 




Certain observations on the international application 



Date of submission of the demand 
11/10/2000 


Date of completion of this report 
06.06.2001 


Name and mailing address of the international 
preliminary examining authority: 

European Patent Office 
/fijl D-80298 Munich 

&t* TeL +49 89 2399 - 0 Tx: 523656 epmu d 
Fax: +49 89 2399 - 4465 


Authorized officer /^^******tx 
Page, M (( ^ }) 

Telephone No. +49 89 2399 7322 
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I. Basis f the r p rt 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally filed n 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 1 7)): 
Description, pages: 

1 -32 as originally filed 

Claims, No.: 

1 -22 as originally filed 

Drawings, sheets: 

1/6-6/6 as originally filed 

Sequence listing part of the description, pages: 

1-45 (SEQ ID NOs. 1-15), as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1(b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

H contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

IS furnished subsequently to this Authority in computer readable form. 

H The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 

the international application as filed has been furnished. 
H The statement that the information recorded in computer readable form is identical to the written sequence 

listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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□ 
□ 



the description, 
the claims, 



pages: 
Nos.: 



□ 



the drawings, 



sheets: 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1. The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
SI claims Nos. 22 (partially). 



□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 



□ the description, claims or drawings {indicate particular elements beloW) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 



H the claims, or said claims Nos. 22 (partially) are so inadequately supported by the description that no 
meaningful opinion could be formed. 

□ no international search report has been established for the said claims Nos. . 

2. A meaningful international preliminary examination cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 

IV. Lack of unity f inv nti n 

1 . In response to the invitation to restrict or pay additional fees the applicant has: 



because: 
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□ restricted the claims. 

□ paid additional fees. 

□ paid additional fees under protest. 

□ neither restricted nor paid additional fees. 

2. E9 This Authority found that the requirement of unity of invention is not complied and chose, according to Rule 

68.1 , not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13.2 and 13.3 is 

□ complied with. 

IS not complied with for the following reasons: 
see separate sheet 

4. Consequently, the following parts of the international application were the subject of international preliminary 
examination in establishing this report: 

all parts. 

□ the parts relating to claims Nos. . 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 



Novelty (N) 


Yes: 


Claims 


9-19, 


21, 


22 


(all 


partially) 




No: 


Claims 


1-19, 


21, 


,22 


(all 


partially) 


Inventive step (IS) 


Yes: 


Claims 


9-19, 


21, 


,22 


(all 


partially) 




No: 


Claims 


1-19, 


21, 


,22 


(all 


partially) 


Industrial applicability (IA) 


Yes: 


Claims 


1-22 












No: 


Claims 













2. Citations and explanations 
see separate sheet 

VI. Certain documents cited 

1 . Certain published documents (Rule 70.10) 

and / or 

2. Non-written disclosures (Rule 70.9) 
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s e separat sh et 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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The application concerns the provision of a yeast and plant polypeptide and polynucleotide 
sequences allegedly corresponding to diacylglycerol acyltransferases. Function is shown 
for Saccharomyces cerevisiae sequences, but neither the function nor any structural 
relationship to the Saccharomyces sequences making such a function plausible are 
demonstrated for the other full-length and partial sequences. 

Re Item II 
Priority 

After considering the priority document, the documents cited "P, X" in the search 
report are not considered relevant for the examination of novelty and inventive step. 

Re Item III 

Non-establishment of opinion with regard to novelty, inventive step and industrial 
applicability 

Claim 18 (claim 22 as originally filed) seeks protection for cells or organisms with 
altered PDAT activity, "wherein the altered PDAT activity is characterized by an 
alteration in gene expression, catalytic activity and/or regulation of activity of the 
enzyme". No reference could be found in the description for alterations to the 
catalytic activity or regulation of PDAT activity and claim 18 (partially) is therefore 
considered to lack meaningful support from the description. The claim has only been 
examined with respect to alterations in gene expression. 

Re Item IV 

Lack of Unity of Invention 

An international application must relate to one invention only or to a group of 
inventions so linked as to form a single general inventive concept. Unity of invention 
is fulfilled only when there is a technical relationship between the inventions involving 
one or more of the same or corresponding special technical features. Special 
technical features are such features that define the contribution of the claimed 
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invention over the prior art. 

The identified 8 inventions relate to a group of sequences with the claimed technical 
feature of being diacylglycerol acy transferases as the sole common link. However, 
this feature cannot be considered to constitute a special technical feature because 
it does not define a contribution over the prior art: SEQ ID NOs. 2, 3, 9, 16, 20 and 
22 have all been previously disclosed in their entirety (D1 , D2 and D3). 
Although the prior art does not disclose the function of the encoded enzymes, they 
do disclose the nucleic acid and polypeptide sequences of the respective claimed 
SEQ ID NOs. The encoded enzyme will have the activity claimed in claim 1, 
regardless of whether or not this is disclosed in the prior art. 

The application therefore does not meet the requirements of Rule 13.2 PCT in that 
there is no common special technical feature linking the 8 inventions of the 
application, these being: 

Invention I Claims 5, 6, 8-22 (all partially) and 1-3 (completely) (form rly 
claims 1, 3, 6, 7, 9, 11-27 (all partially) 2 and 4 (completely)) 

Enzymes catalysing the acyl-CoA-independent transfer of fatty acids to diacylglycerol 
in the production of triacyglycerol from Saccharomyces cerevisiae and corresponding 
to polypeptides with SEQ ID NOs. 2, 16, 20 and 22, encoded by polynucleotides SEQ 
ID NOs. 1,19 and 21, fragments, derivatives, alleles, homologs and isoenzymes, the 
corresponding polynucleotide sequences, portions, derivates, alleles and homologs 
of the polynucleotide sequence, expression vectors, transgenic cells and organisms, 
processes for the production of triacylglycerol using such cells/organisms, the 
product of such a process and the use of the enzymes and polynucleotides in such 
processes. 

Invention II Claims 4-6 and 8-22 (all partially) (formerly claims 1, 3, 5-9 and 
11-27 (all partially)) 

As invention I with SEQ ID NOs. 3, 13 and 23 from Schizosaccharomyces pombe. 

Invention III Claims 4-22 (all partially) (formerly claims 1, 3 and 5-27 (all 
partially)) 

As invention I with SEQ ID NOs. 4-6, 18, 24, 25 from Arabidopsis thaliana. 
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Inv ntion IV Claims 4, 5 and 7-22 (all partially) (form rly claims 1 , 3 and 5-27 
(all partially)) 

As invention I with SEQ ID NOs. 7, 8, 26 and 27 from Zea mays. 

Invention V Claims 5 and 7-22 (all partially) (formerly claims 1 , 3, 6-8 and 10- 
27 (all partially)) 

As invention I with SEQ ID NOs. 9 and 28 from Neurospora crassa. 

Invention VI Claims 4-6 and 8-22 (all partially) (formerly claims 1, 3, 5-9 and 
11-27 (all partially)) 

As invention I with SEQ ID NOs. 10, 14, 17 and 29 from Arabidopsis thaliana. 

Invention VII Claims 4-6 and 8-22 (all partially) (formerly claims 1, 3, 5-9 and 
11-27 (all partially)) 

As invention I with SEQ ID NOs. 11,15 and 30 from Arabidopsis thaliana. 

Invention VIII Claims 5 and 7-22 (all partially) (formerly claims 1 , 3 and 5-27 
(all partially)) 

As invention I with SEQ ID NOs. 12 and 31 from Lycopersicon esculentum. 



Re Item V 

Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 

1 ) Reference is made to the following documents: 

D1 : PETER VERHASSELT ET AL: Twelve open reading frames revealed in the 
23.6kb segment flanking the centromere on the Saccharomyces cerevisiae 
chromosome XIV right arm' YEAST, vol. 10, no. 7, July 1994 (1994-07), pages 
1355-1361 , XP0021 12572 -& Swissprot Database Entry Yn84_Yeast Accession 
number P40345; 1 February 1995 XP0021 12574 

D2: DATABASE EMBL [Online] Database Entry SPBC776, 21 January 1999 (1999- 
01-21) LYNE M. ET AL.: 'S. pombe chromosome II cosmid c776' Database 
accession no. AL035263 XP002 150203 

D3: DATABASE EMBL [Online] Database Entry AI398644, 10 February 1999 
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(1999-02-10) XP0021 50204 & MARY ANNE NELSON ET AL: 'Expressed 
sequences from conidial, mycelial, and sexual stages of Neurospora crassa ' 
FUNGAL GENETICS AND BIOLOGY, vol. 21, 1997, pages 348-363, 
XP000952173 

D4: KEITH STOBART ET AL.: Triacylglycerols are synthesized and utilized by 
transacylation reactions in microsomal preparations of developing safflower 
(Carthamus tinctorius L.) seeds' PLANTA, vol. 203, no. 1, 1997, pages 58-66, 
XP0021 12573 

D5: WO 98 55631 A (CALGENE LLC) 10 December 1998 (1998-12-10) 

2) Novelty - ArL33Ml and (2\ PCT : 

Invention I Claims 5, 6, 8 (all partially) and 1-3 (completely) 
Claims 5, 6, 8 (partially), and 1-3 (completely) lack novelty in light of the sequence 
with the accession number P40345 provided by D1 (identified therein as N2042) 
which, according to the description of the present application, encodes an acyl-CoA- 
independent acyltransferase. Although D1 does not disclose the function of the 
encoded enzyme, a polynucleotide or polypeptide sequence is not rendered novel 
by the discovery of its function. The disclosed sequence is 100% identical to SEQ ID 
NO. 2 over the whole length of the protein. 

Inventions ll-VIII Claims 4-8 (all partially) 

Claims 4-8 (partially) lack novelty in light of the sequences provided by D1 , D2 and 
D3 which, according to the description, are polypeptides and polynucleotides 
corresponding to phospholipid:diacylglycerol acyltransferases. As stated previously, 
Identifying the function of known polypeptides does not render the polypeptides 
novel. 

The description, for example, defines a "functional fragment" on page 4 lines 30-32 
as being "any polypeptide sequence which shows specific enzyme activity of a 
PDAT The enzyme N2042 disclosed in D1 clearly possesses such activity and thus 
the claims lack novelty. 

Similarly, allelic variants are understood to be "any different nucleotide sequence 
which encodes a polypeptide with a functionally different function" and having an 
undisclosed number of substitutions, additions or deletions (page 5 lines 28). Again, 
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the protein of D1 clearly fulfills these requirements. 

The definition provided on page 6 lines 17 and 18 for the term "isoenzyme" meets the 
same objections. 

Furthermore, the definition in the description for the term "portion" is meant to include 
any nucleotide sequence which shows specific activity of a PDAT" (page 5 lines 7- 
17). The term includes within its scope the polynucleotide sequences AI398644 of D3 
for example. 

Inventions l-VIII Claims 9-22 

Claims 9-19, 21 and 22 (partially) appear to be novel in light of the cited prior art. 
although polynucleotide and polypeptide sequences according to the claimed 
invention have been disclosed (e.g. D1 sequence N2042, D2 sequence 094680, D3), 
these documents do not disclose gene constructs, vectors, transgenic cells ro the 
use of such in the production of triacylglycerol. 

Claim 20 (partially) lacks novelty in light of D4, which discloses triacylglycerol made 
with an acyl-Co-A independent acyltransferase (D4 page 59 left-hand column 
paragraph 1). Even if the claim were restricted to triacylglycerol made using novel 
subject matter, the Applicant would need to show how this product differs from 
previously disclosed subject matter, as a product is not rendered novel by a new 
method for making it. 

3) Inventive Step - Art.33m and (3) PCT : 

The following comments on inventive step are confined to subject matter which could 
be acknowledged as being novel, or for which novelty could potentially be restored 
as outlined supra. 

Invention I Claims 9-19, 21 and 22 (all partially) 

The closest prior art is document D5, which discloses a the polypeptide and 
polynucleotide sequences for an acyl-Co-A dependent plant diacylglycerol 
acyltransferase as well as the use of the sequences in engineering plants with altered 
triacylglycerol content (D5 page 3 line 22 to page 5 line 20). 
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In the light of the prior art, the technical problem can be regarded as the provision of 
further polynucleotide and polypeptide sequences encoding enzymes that can alter 
the triacylglycerol content of cells or organisms expressing them. 

Claims 9-19, 21 and 22 appear to be inventive in light of the cited prior art, which 
does not disclose the enzyme activity of SEQ ID NO. 2. There is therefore no 
motivation to combine the teaching of D5 with that of D1 disclosing the sequence 
N2042. 

Inventions ll-VIII 9-19, 21 and 22 (all partially) 

Again, the closest prior art is document D5, which discloses a the polypeptide and 
polynucleotide sequences for an acyl-Co-A dependent plant diacylglycerol 
acyltransferase as well as the use of the sequences in engineering plants with altered 
triacylglycerol content (D5 page 3 line 22 to page 5 line 20). 

In the light of the prior art, the technical problem can be regarded as the provision of 
further polynucleotide and polypeptide sequences encoding enzymes that can alter 
the triacylglycerol content of cells or organisms expressing them. 

It cannot be seen how inventive step can be recognised for claims 9-19, 21 and 22. 
Although function has been demonstrated for the enzyme encoded by SEQ ID NO. 
1 , no such function has been demonstrated for the sequences from other species, 
nor has the Applicant shown that there is a structural relationship between the 
sequences of Invention I and those of Inventions ll-VIII that would make such a 
function plausible. This is true for the full-length sequences as well as the partial 
sequences disclosed in the application. 

Re Item VII 

Certain defects in the international application 

a) Contrary to the requirements of Rule 5.1(a)(ii) PCT, the relevant background art 
disclosed in the documents D1-D5 are not mentioned in the description, nor are 
these documents identified therein. 
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R It m VIII 

Certain observations on the international application 

a) Several of the SEQ ID NOs. appear to be identical duplicates of each other, resulting 
in a lack of conciseness as required by Article 6 PCT. The unnecessary duplicates 
should be removed. 
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Applicant's or agent's file reference 

3377/99 PCT 


POP FURTHER SGG Notification of Transmittal of International Search Report 

(Form PCT/ISA/220) as well as, where applicable, item 5 below. 

ACTION 


International application No. 

PCT/EP 00/02701 


International filing date (day/month/year) 

23/03/2000 


(Earliest) Priority Date (day/month/year) 

01/04/1999 


Applicant 

BASF PLANT SCIENCE GmbH 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 1 8. A copy is being transmitted to the International Bureau. 

This International Search Report consists of a total of 3 sheets. 

[X] It is also accompanied by a copy of each prior art document cited in this report. 



1. 



Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. 

I I the international search was carried out on the basis of a translation of the international application furnished to this 
Authority (Rule 23.1 (b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 
was carried out on the basis of the sequence listing : 

|Xj contained in the international application in written form. 
| | filed together with the international application in computer readable form, 
furnished subsequently to this Authority in written form, 
furnished subsequently to this Authority in computer readble form. 



2. 
3. 



□ 
E 

m 

□ 
□ 



the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 

Certain claims were found unsearchable (See Box I). 
Unity of Invention Is lacking (see Box II). 



5. 



With regard to the title, 

| | the text is approved as submitted by the applicant. 

| X | the text has been established by this Authority to read as follows: 

ENZYMES OF THE BI0SYNTHETIC PATHWAY FOR THE PRODUCTION OF TRIACYLGLYCEROL AND 
RECOMBINANT DNA MOLECULES ENCODING THESE ENZYMES 

With regard to the abstract, 

| X | the text is approved as submitted by the applicant. 

I I the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 
' — I within one month from the date of mailing of this international search report, submit comments to this Authority. 

The figure of the drawings to be published with the abstract is Figure No. J 

|")C] as suggested by the applicant. Q None of the figures. 

[ | because the applicant failed to suggest a figure. 

| | because this figure better characterizes the invention. 
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A. CLASSIFICATION OF SUBJECT MAW*R 

IPC 7 C12N15/54 C12N9/10 C12N15/81 C12N15/82 C12N1/16 
C12N5/10 A01K67/027 C12P7/64 

Accordfng to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
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A NEW CLASS OF ENZYMES IN THE BIOSYNTHETIC PATHWAY FOR THE 
PRODUCTION OF TR1ACYLGLYCEROL AND RECOMBINANT DNA 
MOLECULES ENCODING THESE ENZYMES 

5 The present invention relates to the isolation, identification and characterization 
of recombinant DNA molecules encoding enzymes catalysing the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway for 
the production of triacylglycerol. 

10 Triacylglycerol (TAG) is the most common lipid-based energy reserve in nature. 
The main pathway for synthesis of TAG is believed to involve three sequential 
acyl-transfers from acyl-CoA to a glycerol backbone (1, 2). For many years, 
acyl-CoA : diacylglycerol acy (transferase (DAG AT), which catalyzes the third 
acyl transfer reaction, was thought to be the only unique enzyme involved in 

is TAG synthesis. It acts by diverting diacylglycerol (DAG) from membrane lipid 
synthesis into TAG (2). Genes encoding this enzyme were recently identified 
both in the mouse (3) and in plants (4, 5), and the encoded proteins were 
shown to be homologous to acyl-CoA : cholesterol acyltransf erase (ACAT). It 
was also recently reported that another DAGAT exists in the oleaginous fungus 

20 Mortierella ramanniana, which is unrelated to the mouse DAGAT, the ACAT 
gene family or to any other known gene (6). 

The instant invention relates to novel type of enzymes and their encoding 
genes for transformation. More specifically, the invention relates to use of a 

25 type of genes encoding a not previously described type of enzymes hereinafter 
designated phospholipid:diacylglycerol acyltransferases (PDAT), whereby this 
enzyme catalyses an acyl-CoA-independent reaction. The said type of genes 
expressed alone in transgenic organisms will enhance the total amount of oil 
(triacylglycerols) produced in the cells. The PDAT genes, in combination with a 

30 gene for the synthesis of an uncommon fatty acid will, when expressed in 
transgenic organisms, enhance the levels of the uncommon fatty acids in the 
triacylglycerols. 
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There is considerable interest world-wide in producing chemical feedstock, 
such as fatty acids, for industrial use from renewable plant resources rather 
than non-renewable petrochemicals. This concept has broad appeal to 
5 manufacturers and consumers on the basis of resource conservation and 
provides significant opportunity to develop new industrial crops for agriculture. 

There is a diverse array of unusual fatty acids in oils from wild plant species 
and these have been well characterised. Many of these acids have industrial 
10 potential and this has led to interest in domesticating relevant plant species to 
enable agricultural production of particular fatty acids. 

Development in genetic engineering technologies combined with greater 
understanding of the biosynthesis of unusual fatty acids now makes it possible 
15 to transfer genes coding for key enzymes involved in the synthesis of a 
particular fatty acid from a wild species into domesticated oilseed crops. In this 
way individual fatty acids can be produced in high purity and quantities at 
moderate costs. 

20 In all crops like rape, sunflower, oilpalm etc., the oil (i.e. triacylglycerols) is the 
most valuable product of the seeds or fruits and other compounds like starch, 
protein, and fibre is regarded as by-products with less value. Enhancing the 
quantity of oil per weight basis at the expense of other compounds in oil crops 
would therefore increase the value of crop. If genes, regulating the allocation of 

25 reduced carbon into the production of oil can be up-regulated, the cells will 
accumulate more oil on the expense of other products. Such genes might not 
only be used in already high oil producing cells, such as oil crops, but could 
also induce significant oil production in moderate or low oil containing crops 
such as e.g. soy, oat, maize, potato, sugarbeats, and turnips as well as in 

30 micro-organisms. 
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Summary of the invention 

Many of the unusual fatty acids of interest, e.g. medium chain fatty acids, 
hydroxy fatty acids, epoxy fatty acids and acetylenic fatty acids, have physical 

5 properties that are distinctly different from the common plant fatty acids. The 
present inventors have found that, in plant species naturally accumulating 
these uncommon fatty acids in their seed oil (i.e. triacylglycerol), these acids 
are absent, or present in very low amounts in the membrane (phospho)lipids of 
the seed. The low concentration of these acids in the membrane lipids is most 

10 likely a prerequisite for proper membrane function and thereby for proper cell 
functions. One aspect of the invention is that seeds of transgenic crops can be 
made to accumulate high amounts of uncommon fatty acids if these fatty acids 
are efficiently removed from the membrane lipids and channelled into seed 
triacylglycerols. 

15 

The inventors have identified a novel class of enzymes in plants catalysing the 
transfer of fatty acids from phospholipids to diacylglycerol in the production of 
triacylglycerol through an acyl-CoA-independent reaction and that these 
enzymes (phospholipid:diacylglycerol acyltransf erases, abbreviated as PDAT) 
20 are involved in the removal of hydroxylated, epoxygenated fatty acids, and 
probably also other uncommon fatty acids such as medium chain fatty acids, 
from phospholipids in plants. 

This enzyme reaction was shown to be present in microsomal preparations 
from baker's yeast {Saccharomyces cerevisiae). The instant invention further 

25 pertains to an enzyme comprising an amino acid sequence as set forth in SEQ 
ID No. 2 or a functional fragment, derivate, allele, homolog or isoenzyme 
thereof. A so called , knock out' yeast mutant, disrupted in the respective gene 
was obtained and microsomal membranes from the mutant was shown to 
totally lack PDAT activity. Thus, it was proved that the disrupted gene encodes 

30 a PDAT enzyme (SEQ ID NO. 1 and 2). Furtherm, this PDAT enzyme is 
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characterized through the amino acid sequence as set forth in SEQ ID NO 2 
containing a lipase motif of the conserved sequence string FXKWVEA. 
The instant invention pertains further to an enzyme comprising an amino acid 
sequence as set forth in SEQ ID NO. la, 2b or 5a or a functional fragment, 

5 derivate, allele, homolog or isoenzyme thereof. 

Further genes and/or proteins of so far unknown function were identified and 
are contemplated within the scope of the instant invention. A gene from 
Schizosaccharomyces pombe, SPBC776.14 (SEQ ID. NO. 3), a putative open 
reading frame CAA22887 of the SPBC776.14 (SEQ ID NO. 13) were identified. 

10 Further Arabidopsis thaliana genomic sequences (SEQ ID NO. 4, 10 and 11) 
coding for putative proteins were identified, as well as a putative open reading 
frame AAC80628 from the A. thaliana locus AC 004557 (SEQ ID NO. 14) and a 
putative open reading frame AAD10668 from the A. thaliana locus AC 003027 
(SEQ ID NO. 15) were identified. 

15 Also, a partially sequenced cDNA clone from Neurospora crassa (SEQ ID NO. 
9) and a Zea mays EST (Extended Sequence Tac) clone (SEQ ID NO. 7) and 
corresponding putative amino acid sequence (SEQ ID NO. 8) were identified. 
Finally, two cDNA clones were identified, one Arabidopsis thaliana EST (SEQ 
ID NO. 5 and corresponding predicted amino acid sequence SEQ ID NO. 6) 

20 and a Lycopersicon esculentum EST clone (SEQ ID NO. 12) were identified. 
Further, enzymes designated as PDAT comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID NO 2a, 
3a, 5b, 6 or 7b containing a lipase motif FXKWVEA are contemplated within the 
scope of the invention. Moreover, an enzyme comprising an amino acid 

25 sequence encoded through a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from the group consisting of sequences as set forth 
in SEQ ID No. 1 , 1 b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 9, 9b, 1 0, 1 0b, 1 1 , 1 1 b or 
12 or a functional fragment, derivate, allele, homolog or isoenzyme of the 
enzyme encoding amino acid sequence are included within the scope of the 

30 invention. 
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A functional fragment of the instant enzyme is understood to be any 
polypeptide sequence which shows specific enzyme activity of a 
phospholipid:diacylglyceroi acyltransf erase (PDAT). The length of the functional 
fragment can for example vary in a range from about 660 ±10 amino acids to 

5 660 ± 250 amino acids, preferably from about 660 ± 50 to 660 ±100 amino 
acids, whereby the „basic number" of 660 amino acids corresponds in this case 
to the polypeptide chain of the PDAT enzyme of SEQ ID NO. 2 encoded by a 
nucleotide sequence according to SEQ ID NO. 1. Consequently, the „basic 
number" of functional fullength enzyme can vary in correspondance to the 

10 encoding nucleotide sequence. 

A portion of the instant nucleotide sequence is meant to be any nucleotide 
sequence encoding a polypeptid which shows specific activity of a 
phospholipid:diacylglycerol acyltransferase (PDAT). The length of the 
nucleotide portion can vary in a wide range of about several hundreds of 

15 nucleotides based upon the coding region of the gene or a highly conserved 
sequence. For example the length varies in a range form about 1900 ± 10 to 
1900 ± 1000 nucleotides, preferably form about 1900 ± 50 to 1900 ±700 and 
more preferably form about 1900 ± 100 to 1900 ± 500 nucleotides, whereby the 
„basic number" of 1 900 nucleotides corresponds in this case to the encoding 

20 nucleotide sequence of the PDAT enzyme of SEQ ID NO. 1 . Consequently, the 
„basic number 14 of functional fullength gene can vary. 

An allelic variant of the instant nucleotide sequence is understood to be any 
different nucleotide sequence which encodes a polypeptide with a functionally 

25 equivalent function. The alleles pertain naturally occuring variants of the instant 
nucleotide sequences as well as synthetic nucleotide sequences produced by 
methods known in the art. Contemplated are even altered nucleotide 
sequences which result in an enzyme with altered activity and/or regulation or 
which is resistant against specific inhibitors. The instant invention further 

30 includes natural or synthetic mutations of the originally isolated nucleotide 
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sequences. These mutations can be substitution, addition, deletion, inversion 
or insertion of one or more nucleotides. 

A homologous nucleotide sequence is understood to be a complementary 
5 sequence and/or a sequence which specifically hybridizes with the instant 
nucleotide sequence. Hybridizing sequences include similar sequences 
selected from the group of DNA or RNA which specifically interact to the instant 
nucleotide sequences under at least moderate stringency conditions which are 
known in the art. A preferred, non-limiting example of stringent hybridization 
10 conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at 
about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50- 
65°C. This further includes short nucleotide sequences of e.g. 10 to 30 
nucleotides, preferably 12 to 15 nucleotides. Included are also primer or 
hybridization probes. 

15 

A homologous nucleotide sequence included within the scope of the instant 
invention is a sequence which is at least about 40%, preferably at least about 
50 % or 60%, and more preferably at least about 70%, 80% or 90% and most 
preferably at least about 95%, 96%, 97%, 98% or 99% or more homologous to 
20 a nucleotide sequence of SEQ ID NO. 1 . 

All of the aforementioned definitions are true for amino acid seqences and 
functional enzymes and can easily transfered by a person skilled in the art. 

Isoenzymes are understood to be enzymes which have the same or a similar 
25 substrate specifity and/or catalytic activity but a different primary structure. 

In a first embodiment, this invention is directed to nucleic acid sequences that 
encode a PDAT. This includes sequences that encode biologically active 
PDATs as well as sequences that are to be used as probes, vectors for 
30 transformation or cloning intermediates. The PDAT encoding sequence may 
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encode a complete or partial sequence depending upon the intended use. All or 
a portion of the genomic sequence, cDNA sequence, precursor PDAT or 
mature PDAT is intended. 

5 Further included is a nucleotide sequence selected from the group consisting of 
sequences set forth in SEQ ID No. 1 , lb, 3, 3b, 4, 4a, 4b, 9b, 10, 10b or 1 1 or a 
portion, derivate, allele or homolog thereof. The invention pertains a partial 
nucleotide sequence corresponding to a fullength nucleotide sequence selected 
from the group consisting of sequences set forth in SEQ ID No. 5, 5b, 6b, 7, 8b, 

io 9, 11b or 12 or a portion, derivate, allele or homolog thereof. Moreover, a 
nucleotide sequence comprising a nucleotide sequence which is at least 40% 
homologous to a nucleotide sequence selected form the group consisting of 
those sequences set forth in SEQ ID No. 1 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 
9, 9b, 10, 10b, 11, 1 1b or 12 is contemplated within the scope of the invention. 

15 

The instant invention pertains to a gene construct comprising a said nucleotide 
sequences of the instant invention which is operably linked to a heterologous 
nucleic acid. 

The term operably linked means a serial organisation e.g. of a promotor, coding 
20 sequence, terminator and/or further regulatory elements whereby each element 
can fulfill its original function during expression of the nucleotide sequence. 

Further, a vector comprising of a said nucleotide sequence of the instant 
invention is contemplated in the instant invention. This includes also an 
25 expression vector as well as a vector further comprising a selectable marker 
gene and/or nucleotide sequences for the replication in a host cell and/or the 
integration into the genome of the host cell. 

In a different aspect, this invention relates to a method for producing a PDAT in 
30 a host cell or progeny thereof, including genetically engineered oil seeds, yeast 
and moulds or any other oil accumulating organism, via the expression of a 
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construct in the cell. Cells containing a PDAT as a result of the production of 
the PDAT encoding sequence are also contemplated within the scope of the 
invention. 

5 Further, the invention pertains a transgenic cell or organism containing a said 
nucleotide sequence and/or a said gene construct and/or a said vector. The 
object of the instant invention is further a transgenic cell or organism which is 
an eucaryotic cell or organism. Preferably, the transgenic cell or organism is a 
yeast cell or a plant cell or a plant. The instant invention further pertains said 

10 transgenic cell or organism having an altered biosynthetic pathway for the 
production of triacylglycerol. A transgenic cell or organism having an altered oil 
content is also contemplated within the scope of this invention. 

Further, the invention pertains a transgenic cell or organism wherein the activity 
15 of PDAT is altered in said cell or organism. This altered activity of PDAT is 
characterized by an alteration in gene expression, catalytic activity and/or 
regulation of activity of the enzyme. Moreover, a transgenic cell or organism is 
included in the instant invention, wherein the altered biosynthetic pathway for 
the production of triacylglycerol is characterized by the prevention of 
20 accumulation of undesirable fatty acids in the membrane lipids. 

In a different embodiment, this invention also relates to methods of using a 
DNA sequence encoding a PDAT for increasing the oil-content within a cell. 

25 Another aspect of the invention relates to the accommodation of high amounts 
of uncomman fatty acids in the triacylglycerol produced within a cell, by 
introducing a DNA sequence producing a PDAT that specifically removes these 
fatty acids from the membrane lipids of the cell and channel them into 
triacylglycerol. Plant cells having such a modification are also contemplated 

30 herein. 
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Further, the invention pertains a process for the production of triacylglycerol, 
comprising growing a said transgenic cell or organism under conditions 
whereby the said nucleotide sequence is expressed and whereby the said 
transgenic cells comprising a said enzyme catalysing the transfer of fatty acids 
5 from phospholipids to diacylglycerol forming triacylglycerol. 

Moreover, triacylglycerols produced by the aforementioned process are 
included in scope of the instant invention. 

10 Object of the instant invention is further the use of an instant nucleotide 
sequence and/or a said enzyme for the production of triacylglycerol and/or 
triacylglycerols with uncommon fatty acids. The use of a said instant nucleotide 
sequence and/or a said enzyme of the instant invention for the transformation 
of any cell or organism in order to be expressed in this cell or organism and 

15 result in an altered, preferably increased oil content of this cell or organism is 
also contemplated within the scope of the instant invention. 

A PDAT of this invention includes any sequence of amino acids, such as a 
protein, polypeptide or peptide fragment obtainable from a microorganism, 
20 animal or plant source that demonstrates the ability to catalyse the production 
of triacylglycerol from a phospholipid and diacylglycerol under enzyme reactive 
conditions. By „enzyme reactive conditions" is meant that any necessary 
conditions are available in an environment (e.g., such factors as temperature, 
pH, lack of inhibiting substances) which will permit the enzyme to function. 

25 

Other PDATs are obtainable from the specific sequences provided herein. 
Furthermore, it will be apparent that one can obtain natural and synthetic 
PDATs, including modified amino acid sequences and starting materials for 
synthetic-protein modelling from the examplified PDATs and from PDATs which 
30 are obtained through the use of such examplified sequences. Modified amino 
acid sequences include sequences that have been mutated, truncated, 
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increased and the like, whether such sequences were partially or wholly 
synthesised. Sequences that are actually purified from plant preparations or 
are identical or encode identical proteins thereto, regardless of the method 
used to obtain the protein or sequence, are equally considered naturally 
5 derived. 

Further, the nucleic acid probes (DNA and RNA) of the present invention can 
be used to screen and recover homologous" or „related" PDATs from a variety 
of plant and microbial sources. 

10 

Further, it is also apparent that a person skilled in the art can, with the 
information provided in this application, in any organism identify a PDAT 
activity, purify an enzyme with this activity and thereby identify a ^on- 
homologous" nucleic acid sequence encoding such an enzyme. 

15 

The present invention can be essentially characterized by the following 
aspects: 

1 . Use of a PDAT gene (genomic clone or cDNA) for transformation. 
20 2. Use of a DNA molecule according to item 1 wherein said DNA is used for 
transformation of any organism in order to be expressed in this organism 
and result in an active recombinant PDAT enzyme in order to increase oil 
content of the organism. 

3. Use of a DNA molecule of item 1 wherein said DNA is used for 
25 transformation of any organism in order to prevent the accumulation of 

undesirable fatty acids in the membrane lipids. 

4. Use according to item 1 , wherein said PDAT gene is used for transforming 
transgenic oil accumulating organisms engineered to produce any 
uncommon fatty acid which is harmful if present in high amounts in 

30 membrane lipids, such as medium chain fatty acids, hydroxylated fatty 
acids, epoxygenated fatty acids and acetyfenic fatty acids. 
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5. Use according to item 1, wherein said PDAT gene is used for transforming 
organisms, and wherein said organisms are crossed with other oil 
accummulating organisms engineered to produce any uncommon fatty acid 
which is harmful if present in high amounts in membrane lipids, comprising 

5 medium chain fatty acids, hydroxylated fatty acids, epoxygenated fatty 

acids and acetylenic fatty acids. 

6. Use according to item 1 , wherein the enzyme encoded by said PDAT gene 
or cDNA is coding for a PDAT with distinct acyl specificity. 

7. Use according to item 1 wherein said PDAT encoding gene or cDNA, is 
io derived from Saccharornyces cereviseae, or contain nucleotide sequences 

coding for an amino acid sequence 30% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

8. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 

15 coding for an amino acid sequence 40% or more identical to the amino acid 

sequence of PDAT as presented in SEQ. ID. NO. 2. 

9. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 60% or more identical to the amino acid 

20 sequence of PDAT as presented in SEQ. ID. NO. 2. 

10. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 80% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

25 11. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from plants or contain nucleotide sequences .coding for an amino 
acid sequence 40% or more identical to the amino acid sequence of PDAT 
from Arabidopsis thaiiana or to the protein encoded by the fullength 
counterpart of the partial Zea mays, Lycopericon esculentum, or 

30 Neurospora crassa cDNA clones. 
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12. Transgenic oil accumulating organisms comprising, in their genome, a 
PDAT gene transferred by recombinant DNA technology or somatic 
hybridization. 

13. Transgenic oil accumulating organisms according to item 12 comprising, in 
5 their genome, a PDAT gene having specificity for substrates with a 

particular uncommon fatty acid and the gene for said uncommon fatty acid. 

14. Transgenic organisms according to item 12 or 13 which are selected from 
the group consisting of fungi, plants and animals. 

15. Transgenic organisms according to item 12 or 13 which are selected from 
10 the group of agricultural plants. 

16. Transgenic organisms according to item 12 or 13 which are selected from 
the group of agricultural plants and where said PDAT gene is expressed 
under the control of a storage organ specific promotor. 

17. Transgenic organisms according to item 12 or 13 which are selected from 
is the group of agricultural plants and where said PDAT gene is expressed 

under the control of a seed promotor. 

18. Oils from organisms according to item 12-17. 

19. A method for altering acyl specificity of a PDAT by alteration of the 
nucleotide sequence of a naturally occurring encoding gene and as a 

20 consequence of this alternation creating a gene encoding for an enzyme 

with novel acyl specifity. 

20. A protein encoded by a DNA molecule according to item 1 or a functional 
fragment thereof. 

21. A protein of item 20 designated phospholipid:diacylglycerol acy [transferase. 
25 22. A protein of item 21 which has a distinct acyl specificity. 

23. A protein of item 13 having the amino acid sequence as set forth in SEQ, 
ID NO. 2, 13, 14 or 15 (and the proteins encoded by the fullength or partial 
genes set forth in SEQ. ID. NO. 1, 3, 4, 5, 7, 9, 10, 1 1 or 12) or an amino 
acid sequence with at least 30 % homology to said amino acid sequence. 
30 24. A protein of item 23 isolated from Saccharomyces cereviseae. 
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General methods: 

Yeast strains and plasmids . The wild type yeast strains used were either 
FY1679 (MATa his3-A200 Ieu2-A1 trp1-A6 ura3-52) or W 303-1 A (MATa ADE2- 

5 1 can1-100 his3-11,15 Ieu2-3,112 trp1-1 ura3-1) (7). The YNR008w::KanMX2 
disruption strain FVKT004-04C(AL), which is congenic to FY1679, was 
obtained from the Euroscarf collection (8). A 2751 bp fragment containing the 
YNR008w gene with 583 bp of 5' and 183 bp of 3' flanking DNA was amplified 
from W303-1A genomic DNA using Taq polymerase with 5'- 

10 TCTCCATCTTCTGCAAAACCT-3' and 5'-CCTGTCAAAAACCTTCTCCTC-3' as 
primers. The resulting PGR product was purified by agarose gei electrophoresis 
and cloned into the EcoRV site of pBluescript (pbiuescript-pdat). For 
complementation experiments, the cloned fragment was released from 
pBluescript by H/ndlll-SacI digestion and then cloned between the H/ndlll and 

15 Sad sites of pFL39 (9), thus generating pUS1 . For overexpression of the PDAT 
gene, a 2202 bp EcoRI fragment from the pBluscript plasmid which contains 
only 24 bp of 5' flanking DNA was cloned into the BamHI site of the GAL1- 
TPK2 expression vector pJN92 (12), thus generating pUS4. 

20 Microsomal preparations. Microsomes from developing seeds of sunflower 

(Helianthus annuus), Ricinus communis and Crepis palaestina were prepared 
using the procedure of Stobart and Stymne (11). To obtain yeast microsomes, 
1g of yeast cells (fresh weight) was re-suspended in 8 ml of ice-cold buffer (20 
mM Tris-CI, pH 7.9, 10 mM MgCI 2 , 1 mM EDTA, 5 % (v/v) glycerol, 1 mM DTT, 

25 0.3 M ammonium sulfate) in a 12 ml glass tube. To this tube, 4 ml of glass 
beads (diameter 0.45-0.5 mm) were added, and the tube was then heavily 
shaken (3 x 60 s) in an MSK cell homogenizer (B. Braun Melsungen AG, 
Germany). The homogenized suspension was centrifuged at 20,000 x g for 15 
min at 6°C and the resulting supernatant was again centrifuged at 100,000 x g 

30 for 2 h at 6°C. The 100,000 x g pellet was resuspended in 0.1 M potassium 
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phosphate (pH 7.2), and stored at -80°C. It is subsequently referred to as the 
crude yeast microsomal fraction. 

Lipid substrates. Radio-labeled ricinoleic (12-hydroxy-9-octadecenoic) and 
5 vernolic (12,13-epoxy-9-octadecenoic) acids were synthesized enzymatically 
from [1- 14 C]oleic acid and [1- 14 C]linoleic acid, respectively, by incubation with 
microsomal preparations from seeds of Ricinus communis and Crepis 
palaestina, respectively (12). The synthesis of phosphatidylcholines (PC) or 
phosphatidylethanolamines (PE) with 14 C-labeled acyl groups in the sn-2 
10 position was performed using either enzymatic (13), or synthetic (14) acylation 
of [ 14 C]oieic, [ 14 C]ricinoleic, or [ 14 C]vernolic acid. Dioleoyi-PC that was labeled 
in the srM position was synthesized from sn-1-[ 14 C]oleoyl-lyso-PC and 
unlabeled oleic acid as described in (14). Sn-1-oleoyl-sn-2-[ 14 C]ricinoleoyl-DAG 
was synthesized from PC by the action of phospholipase C type XI from B. 
15 Cereus (Sigma Chemical Co.) as described in (15). Monovernoloyl- and 
divemoleoyl-DAG were synthesized from TAG extracted from seeds of 
Euphorbia iagascae, using the TAG-lipase (Rizhopus arrhizus, Sigma Chemical 
Co.) as previously described (16). Monoricinoleoyl-TAG was synthesized 
according to the same method using TAG extracted from Castor bean. 

20 

Lipid analysis. Total lipid composition of yeast were determined from cells 
harvested from a 40 ml liquid culture, broken in a glass-bead shaker and 
extracted into chloroform as described by Bligh and Dyer (17), and then 
separated by thin layer chromatography in hexane/diethylether/acetic acid 

25 (80:20:1) using pre-coated silica gel 60 plates (Merck). The lipid areas were 
located by brief exposure to l 2 vapors and identified by means of appropriate 
standards. Polar lipids, sterol-esters and triacylgiycerols, as well as the 
remaining minor lipid classes, referred to as other lipids, were excised from the 
plates. Fatty acid methylesters were prepared by heating the dry excised 

30 material at 85 °C for 60 min in 2% (v/v) sulfuric acid in dry methanol. The 
methyl esters were extracted with hexane and analyzed by GLC through a 50 m 
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x 0.32 mm CP-Wax58-CB f used-silica column (Chrompack), with 
methylheptadecanoic acid as an internal standard. The fatty acid content of 
each fraction was quantified and used to calculate the relative amount of each 
lipid class. In order to determine the total lipid content, 3 ml aliquots from yeast 
5 cultures were harvested by centrifugation and the resulting pellets were washed 
with distilled water and lyophilized. The weight of the dried cells was 
determined and the fatty acid content was quantified by GLC-analyses after 
conversion to methylesters as described above. The lipid content was then 
calculated as nmoi fatty acid (FA) per mg dry weight yeast. 

io 

Enzyme assays. Aliquots of crude microsomal fractions (corresponding to 
10 nmol of microsomal PC) from developing plant seeds or yeast cells were 
lyophilized over night. 14 C-Labeled substrate lipids dissolved in benzene were 
then added to the dried microsomes. The benzene was evaporated under a 

15 stream of N 2 , leaving the lipids in direct contact with the membranes, and 0.1 
ml of 50 mM potassium phosphate (pH 7.2) was added. The suspension was 
thoroughly mixed and incubated at 30°C for the time period indicated, up to 90 
min. Lipids were extracted from the reaction mixture using chloroform and 
separated by thin layer chromatography in hexane/diethylether/acetic acid 

20 (35:70:1.5) using silica gel 60 plates (Merck). The radioactive lipids were 
visualized and quantified on the plates by electronic autoradiography (Instant 
Imager, Packard, US). 

Yeast cultivation. Yeast cells were grown at 28°C on a rotatory shaker in 
25 liquid YPD medium (1% yeast extract, 2% peptone, 2% glucose), synthetic 
medium (18) containing 2% (v/v) glycerol and 2% (v/v) ethanol, or minimal 
medium (19) containing 16 g/l of glycerol. 

The instant invention is further characterized by the following examples which 
30 are not limiting: 
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Acvl-CoA-indeoendent synthesis of TAG by oil seed microsomes. A large 
number of unusual fatty acids can be found in oil seeds (20). Many of these 
fatty acids, such as ricinoleic (21) and vernolic acids (22), are synthesized 
using phosphatidyicholin (PC) with oleoyl or linoleoyl groups esterified to the 

5 sn-2 position, respectively, as the immediate precursor. However, even though 
PC can be a substrate for unusual fatty acid synthesis and is the major 
membrane lipids in seeds, unusual fatty acids are rarely found in the 
membranes. Instead, they are mainly incorporated into the TAG. A mechanism 
for efficient and selective transfer of these unusual acyl groups from PC into 

10 TAG must therefore exist in oil seeds that accumulate such unusual fatty acids. 
This transfer reaction was biochemically characterized in seeds from castor 
bean (Ricinus communis) and Crepis palaestina, plants which accumulate high 
levels of ricinoleic and vernolic acid, respectively, and sunflower (Helianthus 
annuus), a plant which has only common fatty acids in its seed oil. Crude 

15 microsomal fractions from developing seeds were incubated with PC having 
14 C-labeled oleoyi, ricinoleoyl or vemoloyl groups at the sn-2 position. After the 
incubation, lipids were extracted and analyzed by thin layer chromatography. 
We found that the amount of radioactivity that was incorporated into the neutral 
lipid fraction increased linearly over a period of 4 hours (data not shown). The 

20 distribution of [ 14 C]acyl groups within the neutral lipid fraction was analyzed 
after 80 min (Fig. 1). Interestingly the amount and distribution of radioactivity 
between diffferent neutral lipids were strongly dependent both on the plant 
species and on the type of [ 14 C]acyl chain. Thus, sunflower microsomes 
incorporated most of the label into DAG, regardless of the type of [ 14 C]acyI 

25 group. In contrast, R. communis microsomes preferentially incorporated 
[ 14 C]ricinoieoyl and [ 14 C]vernoloyl groups into TAG, while [ 14 C]oleyl groups 
mostly were found in DAG. C. palaestina microsomes, finally, incorporated only 
[ 14 C]vernoiyol groups into TAG, with [ 14 C]ricinoleyl groups being found mostly 
as free fatty acids, and [ 14 C]oleyl groups in DAG. This shows that the high in 

30 vivo levels of ricinoleic acid and vernolic acid in the TAG pool of R. communis 
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and C. paiaestina, respectively, can be explained by an efficient and selective 
transfer of the corresponding acyl groups from PC to TAG in these organisms. 

The in-vitro synthesis of triacylglycerols in microsomal preparations of 
5 developing castor bean is summarized in table 1 . 

PDAT: a novel enzyme that catalyzes acvl-CoA independent synthesis of 
TAG. It was investigated if DAG could serve both as an acyl donor as well as 
an acyl acceptor in the reactions catalyzed by the oil seed microsomes. 

10 Thererfore, unlabeled divernoloyl-DAG was incubated with either s/>1 -oleoyl- 
sr?-2-[ 14 C]ricinoleoyl-DAG or sn-1 -oleoyi-sn-2-[ 14 C]ricinoleoyl-PC in the 
presence of R. communis microsomes. The synthesis of TAG molecules 
containing both [ 14 C]ricinoleoyl and vernoloyl groups was 5 fold higher when 
[ 14 C]ricinoleoyl-PC served as acyl donor as compared to [ 14 C]ricinoleoyl-DAG 

15 (fig.1B). These data strongly suggests that PC is the immediate acyl donor and 
DAG the acyl acceptor in the acyl-CoA-independent formation of TAG by oil 
seed microsomes. Therefore, this reaction is catalyzed by a new enzyme which 
we call phospholipid : diacylglycerol acyltransf erase (PDAT). 

20 PDAT activity in yeast microsomes. Wild type yeast cells were cultivated 

under conditions where TAG synthesis is induced. Microsomal membranes 
were prepared from these cells and incubated with sn-2-[ 14 C]-ricinoleoyl-PC 
and DAG and the 14 C-labeled products formed were analyzed. The PC-derived 
[ 14 C]ricinoleoyl groups within the neutral lipid fraction mainly were found in free 

25 fatty acids or TAG, and also that the amount of TAG synthesized was 
dependent on the amount of DAG that was added to the reaction (Fig.2). The in 
vitro synthesis of TAG containing both ricinoleoyl and vernoloyl groups, a TAG 
species not present in vivo, from exogenous added sn-2-[ 14 C]ricinoleoyl-PC and 
unlabelled vernoloyl-DAG (Fig. 2, lane 3) clearly demonstrates the existence of 

30 an acyl-CoA-independent synthesis of TAG involving PC and DAG as 
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substrates in yeast microsomal membranes. Consequently, TAG synthesis in 
yeast can be catalyzed by an enzyme similar to the PDAT found in plants. 

The PDAT encoding aene in veast 

5 A gene in the yeast genome (YNR008w) is known, but nothing is known about 
the function of YNR008w, except that the gene is not essential for growth under 
normal circumstances. Microsomal membranes were prepared from the yeast 
strain FVKT004-04C(AL) (8) in which this gene with unknown function had been 
disrupted. PDAT activity in the microsomes were assayed using PC with 

io radiolabeled fatty acids at the sn-2 position. The activity was found to be 
completely absent in the disruption strain (Fig. 2 lane 4). Significantly, the 
activity could be partially restored by the presence of YNR008w on the single 
copy plasmid pUS1 (Fig. 2 lane 5). Moreover, acyl groups of 
phosphatidylethanolamine (PE) were efficiently incorporated into TAG by 

15 microsomes from the wild type strain whereas no incorporation occured from 
this substrate in the mutant strain (data not shown). This shows that YNR008w 
encodes a yeast PDAT which catalyzes the transfer of an acyl group from the 
sn-2 position of phospholipids to DAG, thus forming TAG. It should be noted 
that no cholesterol esters were formed from radioactive PC even in incubations 

20 with added ergosterols, nor were the amount of radioactive free fatty acids 
formed from PC affected by disruption of the YNR008w gene (data not shown). 
This demonstrates that yeast PDAT do not have cholesterol ester synthesising 
or phospholipase activities. 

25 Increased TAG content in veast ceils that overexoress PDAT . The effect 

of overexpressing the PDAT-encoding gene was studied by transforming a wild 
type yeast strain with the pUS4 plasmid in which the gene is expressed from 
the galactose-induced GAL1 :TPK2 promoter. Cells containing the empty 
expression vector were used as a control. The cells were grown in synthetic 

30 glycerol-ethanol medium, and expression of the gene was induced after either 2 
hours (early log phase) or 25 hours (stationary phase) by the addition of 
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galactose. The cells were then incubated for another 21 hours, after which they 
were harvested and assays were performed. We found that overexpression of 
PDAT had no significant effect on the growth rate as determined by the optical 
density. However, the total lipid content, measured as pmol fatty acids per mg 

5 yeast dry weight, was 47% (log phase) or 29% (stationary phase) higher in the 
PDAT overexpressing strain than in the control. Furthermore, the polar lipid and 
sterolester content was unaffected by overexpression of PDAT. Instead, the 
elevated lipid content in these cells is entirely due to an increased TAG content 
(Fig. 3A,B). Thus, the amount of TAG was increased by 2-fold in PDAT 

io overexpressing early log phase cells and by 40% in stationary phase cells. It is 
interesting to note that a significant increase in the TAG content was achieved 
by overexpressing PDAT even under conditions (i.e. in stationary phase) where 
DAGAT is induced and thus contributes significantly to TAG synthesis. In vitro 
PDAT activity assayed in microsomes from the PDAT overexpressing strain 

15 was 7-fold higher than in the control strain, a finding which is consistent with the 
increased levels of TAG that we observed in vivo (Fig. 3C). These results 
clearly demonstrate the potential use of the PDAT gene in increasing the oil 
content in transgenic organisms. 

20 Substrate specificity of yeast PDAT. The substrate specificity of yeast 

PDAT was analyzed using microsomes prepared from the PDAT 
overexpressing strain (see Fig. 4). The rate of TAG synthesis, under conditions 
given in figure 4 with di-oleoyl-PC as the acyl-donor, was 0.15 nmol per min 
and mg protein. With both oleoyl groups of PC labeled it was possible, under 

25 the given assay conditions, to detect the transfer of 1 1 pmol/min of [ 14 C]oleoyl 
chain into TAG and the formation of 15 pmol/min of lyso-PC. In microsomes 
from the PDAT-deficient strain, no TAG at all and only trace amounts of lyso- 
PC was detected, strongly suggesting that yeast PDAT catalyses the formation 
of equimolar amounts of TAG and lyso-PC when supplied with PC and DAG as 

30 substrates. The fact that somewhat more lyso-PC than TAG is formed can be 
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explained by the presence of a phospholipase in yeast microsomes, which 
produces lyso-PC and unesterified fatty acids from PC. 

The specificity of yeast PDAT for different acyi group positions was 

5 investigated by incubating the microsomes with di-oleoyl-PC carrying a 
[ 14 C]acyl group either at the sn-1 position (Fig. 4A bar 2) or the sn-2 position 
(Fig. 4A bar 3). We found that the major 14 C-labeled product formed in the 
former case was lyso-PC, and in the latter case TAG. We conclude that yeast 
PDAT has a specificity for the transfer of acyl groups from the sn-2 position of 

10 the phospholipid to DAG, thus forming sn-1 -lyso-PC and TAG. Under the given 
assay conditions, trace amounts of 14 C-labelled DAG is formed from the sn-1 
labeled PC by the reversible action of a CDP-choline : choline 
phosphotransferase. This labeled DAG can then be further converted into TAG 
by the PDAT activity. It is therefore not possible to distinguish whether the 

15 minor amounts of labeled TAG that is formed in the presence of di-oleoyl-PC 
carrying a [ 14 C]acyl group in the sn-1 position, is synthesized directly from the 
sn-1 -labeled PC by a PDAT that also can act on the sn-1 postion, or if it is first 
converted to sn-1 -labeled DAG and then acylated by a PDAT with strict 
selectivity for the transfer of acyl groups at the sn-2 position of PC. Taken 

20 together, this shows that the PDAT encoded by YNR008w catalyses an acyl 
transfer from the sn-2 position of PC to DAG, thus causing the formation of 
TAG and lyso-PC. 

The substrate specificity of yeast PDAT was further analyzed with respect 
25 to the headgroup of the acyl donor, the acyl group transferred and the acyl 
chains of the acceptor DAG molecule. The two major membrane lipids of S. 
cerevisiae are PC and PE, and as shown in Fig. 4B (bars 1 and 2), dioleoyl-PE 
is nearly 4-fold more efficient than dioleoyl-PC as acyl donor in the PDAT- 
catalyzed reaction. Moreover, the rate of acyl transfer is strongly dependent on 
30 the type of acyl group that is transferred. Thus, a ricinoleoyl group at the sn-2 
position of PC is 2.5 times more efficiently transferred into TAG than an oleoyl 
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group in the same position (Fig. 4B bars 1 and 3). In contrast, yeast PDAT has 
no preference for the transfer of vemoloyl groups over oleoyi groups (Fig. 4B 
bars 1 and 4). The acyl chain of the acceptor DAG molecule also affects the 
efficiency of the reaction. Thus, DAG with a ricinoleoyl or a vernoioyl group is a 
5 more efficient acyl acceptor than dioleoyl-DAG (Fig. 4B bars 1 , 5 and 6). Taken 
together, these results clearly show that the efficiency of the P DAT-catalyzed 
acyl transfer is strongly dependent on the properties of the substrate lipids. 

PDAT aenes. Nucleotide and amino acid sequences of several PDAT 
10 genes are given as SEQ ID No. 1 through 15. Futher provisional and/or partial 
sequences are given as SEQ ID NO 1a through 5a and 1b through 11b, 
respectively. One of the Arabidopsis genomic sequences (SEQ ID NO. 4) 
identfied an Arabidopsis EST cDNA clone; T04806. This cDNA clone was fully 
characterised and the nucleotide sequence is given as SEQ ID NO. 5. Based 
15 on the sequence homology of the T04806 cDNA and the Arabidopsis thaliana 
genomic DNA sequence (SEQ ID NO 4) it is apparent that an additional A is 
present at position 417 in the cDNA clone (data not shown). Excluding this 
nucleotide would give the amino acid sequence depicted in SEQ ID NO. 12. 

20 Increased TAG content in seeds of Arabidopsis thaliana that express the 

veast PDAT. For the expression of the yeast PDAT gene in Arabidopsis 
thaliana an EcoRI fragment from the pBluescript-PDAT was cloned together 
with napin promotor (25) into the vector pGPTV-KAN (26). A plasmid 
(pGNapPDAT) having the yeast PDAT gene in the correct orientation was 

25 identified and transformed into Agrobacterium tumefaciens. These bacteria 
were used to transform Arabidopsis thaliana Columbia (C-24) plants using the 
root transformation method (27). Plants transformed with an empty vector were 
used as controls. 

First generation seeds (T1) were harvested and germinated on kanamycin 
30 containing medium. Second generation seeds (T2) were pooled from individual 
plants and their fatty acid contents analysed by quantification of their methyl 
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esthers by gas liquid chromatography after methylation of the seeds with 2% 
sulphuric acid in methanol at 85 °C for 1 ,5 hours. Quantification was done with 
heptadecanoic acid methyl esters as internal standard. 

From the transformation with pGNapPDAT one T1 plant (26-14) gave raise to 
5 seven T2 plants of which 3 plants yielded seeds with statistically (in a mean 
difference two-sided test) higher oil content than seeds from T2 plants 
generated from T1 plant 32-4 transformed with an empty vector (table 2). 
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Description of Figures 
FIG. 1 . 

Metabolism of 14 C-labeled PC into the neutral lipid fraction by plant 
5 microsomes. (A) Microsomes from developing seeds of sunflower, R. 
communis and C. palaestina were incubated for 80 min at 30°C with PC (8 
nmol) having oleic acid in its s/>1 position, and either 14 C-labeied oleic, 
ricinoleic or vernolic acid in its sn-2 position. Radioactivity incorporated in TAG 
(open bars), DAG (solid bars), and unsterified fatty acids (hatched bars) was 
10 quantified using thin layer chromatography followed by electronic 
autoradiography, and is shown as percentage of added labeled substrate. (B) 
Synthesis in vitro of TAG carrying two vemoloyi and one [ 14 C]ricinoleoyl group 
by microsomes from R. communis. The substrates added were unlabeled 
divernoloyl-DAG (5 nmol), together with either sn-1-oleoyl-s/>2-[ 14 C]ricinoleoyl- 
15 DAG (0.4 nmol, 7700 dpm/nmol) or sn-1-oleoyl-SA7-2-[ 14 C]ricinoleoyl-PC (0.4 
nmol, 7700 dpm/nmol). The microsomes were incubated with the substrates for 
30 min at 30°C, after which samples were removed for lipid analysis as 
described in the section ..general methods". The data shown are the average of 
two experiments. 

20 

FIG. 2. 

PDAT activity in yeast microsomes, as visualized bv autoradioqram of neutral 
lipid products separated on TLC . Microsomal membranes (10 nmol of PC) from 
the wild type yeast strain FY1679 (lanes 1-3), a congenic yeast strain 

25 (FVKT004-04C(AL)) that is disrupted for YNR008w (lane 4) or the same 
disruption strain transformed with the plasmid pUS1, containing the YNR008w 
gene behind its native promotor (lane 5), were assayed for PDAT activity. As 
substrates, we used 2 nmol sn-1-oleoyl-sn-2-[ 14 C]ricinoleoyl-PC together with 
either 5 nmol of dioleoyl-DAG (lanes 2, 4 and 5) or rac-oieoyl-vernoleoyl-DAG 

30 (lane 3). The enzymatic assay and lipid analysis was performed as described in 
Materials and Methods. The cells were precultured for 20 h in liquid YPD 
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medium, harvested and re-suspended in an equal volume of minimal medium 
(19) containing 16 g/l glycerol. The cells were then grown for an additional 24 h 
prior to being harvested. Selection for the plasmid was maintained by growing 
the transformed cells in synthetic medium lacking uracil (18). Abbreviations: 1- 
5 OH-TAG, monoricinoleoyi-TAG; 1 -OH-1 -ep-TAG, monoricinoleoyl- 
monovernoioyl-TAG; OH-FA, unesterified ricinoleic acid. 

Fig. 3. 

Lipid content (A.B) and PDAT activity (C) in PDAT overexpressinq yeast cells. 

io The PDAT gene in the plasmid pUS4 was overexpressed from the galactose- 
induced GAL1-TPK2 promoter in the wild type strain W303-1A (7). Its 
expression was induced after (A) 2 hours or (B) 25 hours of growth by the 
addition of 2% final concentration (w/v) of galactose. The cells were then 
incubated for another 22 hours before being harvested. The amount of lipids of 

15 the harvested cells was determined by GLC-analysis of its fatty acid contents 
and is presented as pmol fatty acids per mg dry weight in either TAG (open 
bar), polar lipids (hatched bar), sterol esters (solid bar) and other lipids (striped 
bar). The data shown are the mean values of results with three independent 
yeast cultures. (C) In vitro synthesis of TAG by microsomes prepared from 

20 yeast cells containing either the empty vector (vector) or the PDAT plasmid (+ 
PDAT). The cells were grown as in Fig. 3A. The substrate lipids dioleoyl-DAG 
(2.5 nmol) and s/>1-oleoyl-sn-2-[ 14 C]-oleoyl-PC (2 nmol) were added to aiiquots 
of microsomes (10 nmol PC), which were then incubated for 10 min at 28 °C. 
The amount of label incorporated into TAG was quantified by electronic 

25 autoradiography. The results shown are the mean values of two experiments. 

FIG. 4. 

Substrate specificity of veast PDAT, The PDAT activity was assayed by 
incubating aiiquots of lyophilized microsomes (10 nmol PC) with substrate lipids 
30 at 30°C for 10 min (panel A) or 90 min (panel B). Unlabeled DAG (2.5 nmol) 
was used as substrates together with different labeled phospholipids, as shown 
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in the figure. (A) S/>position specificity of yeast PDAT regarding the acyl donor 
substrate. Dioleoyl-DAG together with either s/>1-[ 14 C]oleoyl-sn-2-[ 14 C]oleoyi- 
PC (di-[ 14 C]-PC), sn-1-[ 14 C]oleoyl-sn-2-oleoyl-PC (sn1-[ 14 C]-PC) or sn-1-oleoyl- 
sn-2-[ 14 C]oleoyl-PC (sn2-[ 14 C]-PC). (B) Specificity of yeast PDAT regarding 

5 phospholipid headgroup and of the acyl composition of the phospholipid as well 
as of the diacylglycerol. Dioleoyl-DAG together with either s/>1-oleoyl-sn-2- 
[ 14 C]oleoyl-PC (oleoyl-PC), sn-1-oleoyl-sn-2-[ 14 C]oleoyl-PE (oleoyl-PE), sn-1- 
oleoyl-sn-2-[ 14 C]ricinoleoyl-PC (ricinoleoyl-PC) or sn-1 -oleoyl-sn-2- 
[ 14 C]vernoloyl-PC (vernoloyl-PC). In the experiments presented in the 2 bars to 

10 the far right, monoricinoleoyl-DAG (ricinoleoyl-DAG or mono-vernoioyl-DAG 
(vernoioyl-DAG) were used together with sn-1-oleoyl-s/>2-[ 14 C]-oleoyl-PC. The 
label that was incorporated into TAG (solid bars) and lyso-PC (LPC, open bars) 
was quantified by electronic autoradiography. The results shown are the mean 
values of two experiments. The microsomes used were from W303-1A cells 

15 overexpressing the PDAT gene from the GAL1-TPK2 promotor, as described in 
Fig. 3. The expression was induced at early stationary phase and the cells were 
harvested after an additional 24 h. 

20 TAB.1: 

In vitro synthesis of triacvlqlvcerols in microsomal preparations of developing 
castor bean. Aliqouts of microsomes (20 nmol PC) were lyophilised and 
substrate lipids were added in benzene solution: (A) 0.4 nmol [ 14 C]-DAG (7760 
dpm/nmol) and where indicated 1.6 nmol unlabelled DAG; (B) 0.4 nmol [ 14 C]- 

25 DAG (7760 dpm/nmol) and 5 nmol unlabelled di-ricinoleoyl-PC and (C) 0.25 
nmol [ 14 C]-PC (4000 dpm/nmol) and 5 nmol unlabelled DAG. The benzene was 
evaporated by N 2 and 0.1 ml of 50 mM potassium phosphate was added, 
thoroughly mixed and incubated at 30 °C for (A) 20 min.; (B) and (C) 30 min.. 
Assays were terminated by extraction of the lipids in chloroform. The lipids 

30 were then separated by thin layer chromatography on silica gel 60 plates 
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(Merck; Darmstadt, Germany) in hexan/diethylether/acetic 35:70:1.5. The 
radioactive lipids were visualised and the radioactivity quantified on the plate by 
electronic autoradiography (Instant Imager, Packard, US). Results are 
presented as mean values of two experiments. 

5 

Radioactivity in different triacylglycerols (TAG) species formed. Abbreviations 
used: 1-OH-, mono-ricinoleoyl-; 2-OH, di-ricinoleoyl-; 3-OH-, triricinoleoyl; 1- 
OH-1 -ver-, mono-ricinoleoly-monovernoleoyt-; 1 -OH-2-ver-, mono-ricinoleoyl- 
divernoleoyk Radiolabelled DAG and PC were prepared enzymatically. The 
10 radiolabelled ricinoleoyl group is attached at the sn-2-position of the lipid and 
unlabelled oleoyl group at the sn-1 -position. Unlabelled DAG with vernoleoyi- or 
ricinoleoyl chains were prepared by the action of TAG lipase (6) on oil of 
Euphorbia lagascae or Castor bean, respectively. Synthetic di-ricinoleoyl-PC 
was kindly provided from Metapontum Agribios (Italy). 

15 

TAB.2: 

Total fatty acids per mg of T2 seeds pooled from individual Arabidopsis thaliana 
plants transformed with yeast PDAT gene under the control of napin promoter 
20 (26-14) or transformed with empty vector (32-4). 

* = stastistical difference between control plants and PDAT transformed plants 
in a mean difference two-sided test at cx = 5. 
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Description of the SEQ ID: 

SEQ ID NO. 1: Genomic DNA sequence and suggested amino acid sequence of 
the Saccharomyces cerevisiae PDAT gene, YNR008w, with GenBank accession 
5 number Z71 623 and Y1 31 39, and with nucleotide ID number 1 302481 . 

SEQ ID NO. 2. The amino acid sequence of the suggested open reading frame 
YNR008w from Saccharomyces cerevisiae. 

10 SEQ ID NO. 3: Genomic DNA sequence of the Schizosaccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 4: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AB006704. 

15 

SEQ ID NO. 5: Nucleotide sequence of the Arabidopsis thaliana cDNA clone with 
GenBank accession number T04806, and nucleotide ID number 315966. 

SEQ ID NO. 6: Predicted amino acid sequence of the Arabidopsis thaliana cDNA 
20 clone with GenBank accession number T04806. 

SEQ ID NO. 7: Nucleotide and amino acid sequence of the Zea mays EST clone 
with GenBank accession number AI491339, and nucleotide ID number 4388167. 

25 SEQ ID NO. 8r. Predicted amino acid sequence of the Zea mays EST clone with 
GenBank accession number AI491339, and nucleotide ID number 4388167. 

SEQ ID NO. 9: DNA sequence of part of the Neurospora crassa EST clone 
W07G1, with GenBank accession number AI398644, and nucleotide ID number 
30 4241729. 
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SEQ ID NO. 10: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AC004557. 

SEQ ID NO. 11: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
5 with GenBank accession number AC003027. 

SEQ ID NO. 12. DNA sequencce of part of the Lycopersicon esculentum cDNA 
clone with GenBank accession number AI486635. 

io SEQ ID NO. 13: Amino acrd sequence of the Schizosaccharomyces pombe 
putative open reading frame CAA22887 of the Schizosaccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 14: Amino acid sequence of the Arabidopsis thaliana putative open 
15 reading frame AAC80628 derived from the Arabidopsis thaliana locus with 
GenBank accession number AC004557. 

SEQ ID NO 15: Amino acid sequence of the Arabidopsis thaliana putative open 
reading frame AAD10668 derived from the Arabidopsis thaliana locus with 
20 GenBank accession number AC003027. 

Further provisional and/or partial sequences are defined through the followino 
SEQ IDs: 

25 SEQ ID NO. 1a: The amino acid sequence of the yeast ORF YNR008w from 
Saccharomyces cerevisiae. 

SEQ ID NO. 2a: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AC004557). 

30 
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SEQ ID NO. 3a: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AB006704). 

SEQ ID NO. 4a: The corresponding genomic DNA sequence and amino acid 
5 sequence of the yeast ORF YNR008w from Saccharomyces cerevisiae. 

SEQ ID NO. 5a: The amino acid sequence of the yeast ORF YNR008w from 
Saccharomyces cerevisiae derived form the corresponding genomic DNA 
sequence. 

10 

SEQ ID NO. 1b: Genomic DNA sequence of the Saccharomyces cerevisiae 
PDAT gene, YNR008w, genebank nucleotide ID number 1302481, and the 
suggested YNR008w amino acid sequence. 

15 

SEQ ID NO. 2b: The suggested amino acid sequence of the yeast gene 
YNR008w from Saccharomyces cerevisiae. 

SEQ ID NO. 3b: Genomic DNA sequence of the Schizosaccharomyces pombe 
20 gene SPBC776.14. 

SEQ ID NO. 4b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AB006704. 

25 SEQ ID NO. 5b: Nucleotide sequence and the corresponding amino acid 
sequence of the Arabidopsis thaliana EST-clone with genebank accession 
number T04806, and ID number 315966. 

SEQ ID NO. 6b: Nucleotide and amino acid sequence of the Zea mays cDNA 
30 clone with genebank ID number 4388167. 
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SEQ ID NO. 7b: Amino acid sequence of the Zea mays cDNA clone with 
genebank ID number 4388167. 

SEQ ID NO. 8b: DNA sequence of part of the Neurospora crassa cDNA clone 
5 W07G1 , ID number 4241729. 

SEQ ID NO. 9b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC004557. 

10 SEQ ID NO. 10b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC003027. 

SEQ ID NO. 1 1b: DNA sequence of part of the Lycopersicon esculentum cDNA 
clone with genebank accession number AI486635. 

15 
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Claims 

1 . An enzyme catalysing in an acyl-CoA-independent reaction the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway 

5 for the production of triacylglycerol. 

2. An enzyme according to claim 1 , comprising an amino acid sequence as 
set forth in SEQ ID No. 2 or a functional fragment, derivate, allele, homolog 
or isoenzyme thereof. 

10 

3. An enzyme according to claims 1 or 2 designated as 
phospholipid:diacylglycerol acyltransferase (PDAT). 

4. An enzyme according to claims 1 to 3, comprising an amino acid sequence 
15 as set forth in SEQ ID No. 1a, 2b or 5a or a functional fragment, derivate, 

allele, homolog or isoenzyme thereof. 

5. An enzyme according to claims 1 to 4, comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID No. 

20 2a, 3a, 5b, 6, 7b, 8, 13, 14 or 15 or a functional fragment, derivate, allele, 

homolog or isoenzyme thereof. 

6. An enzyme according to claims 1 to 5, comprising an amino acid sequence 
encoded through a nucleotide sequence, a portion, derivate, allele or 

25 homolog thereof selected from the group consisting of sequences as set 

forth in SEQ ID No. 1, 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 9, 9b, 10, 10b, 
11, 11b or 12 or a functional fragment, derivate, allele, homolog or 
isoenzyme of the enzyme encoding amino acid sequence. 

30 7. A nucleotide sequence encoding an enzyme catalysing in an acyl-CoA- 
independent reaction the transfer of fatty acids from phospholipids to 
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diacylglycerol in the biosynthetic pathway for the production of 
triacylglycerol. 

8. A nucleotide sequence according to claim 7 encoding an enzyme 
5 designated as phospholipid:diacylglycerol acyltransferase (PDAT). 

9. A nucleotide sequence according to claims 7 or 8, selected from the group 
consisting of sequences as set forth in SEQ ID No. 1, 1b, 3, 3b, 4, 4a, 4b, 
9b, 10, 10b or 1 1 or a portion, derivate, allele or homolog thereof. 

10 

10. A partial nucleotide sequence corresponding to a fullength nucleotide 
sequence according to claims 7 to 9, selected from the group consisting of 
sequences as set forth in SEQ ID No. 5, 5b, 6b, 7, 8b, 9, 11b or 12 or a 
portion, derivate, allele or homolog thereof. 

15 

1 1 . A nucleotide sequence according to claims 7 to 10, comprising a nucleotide 
sequence which is at least 40% homologous to a nucleotide sequence 
selected form the group consisting of those sequences set forth in SEQ ID 
No. 1, 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 9, 9b, 10, 10b, 11, 11b or 12. 

20 

12. A gene construct comprising a nucleotide sequence according to claims 7 
to 1 1 operably linked to a heterologous nucleic acid. 

13. A vector comprising a nucleotide sequence according to claims 7 to 1 1 or a 
25 gene construct according to claim 12. 

14. A vector according to claim 13, which is an expression vector. 

15. A vector according to claims 13 or 14, further comprising a selectable 
30 marker gene and/or nucleotide sequences for the replication in a host cell 

or the integration into the genome of the host cell. 
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16. A transgenic cell or organism containing a nucleotide sequence according 
to claims 7 to 11 and/or a gene construct according to claim 12 and/or a 
vector according to claims 13 to 15. 

5 

17. A transgenic cell or organism according to claim 16 which is an eucaryotic 
cell or organism. 

18. A transgenic cell or organism according to claims 16 or 17 which is a yeast 
10 cell or a plant cell or a plant. 

19. A transgenic cell or organism according to claims 16 to 18 having an 
altered biosynthetic pathway for the production of triacylglycerol. 

15 20. A transgenic cell or organism according to claims 16 to 19 having an 
altered oil content. 

21. A transgenic cell or organism according to claims 16 to 20 wherein the 
activity of PDAT is altered. 

20 

22. A transgenic cell or organism according to claims 16 to 21 wherein the 
altered activity of PDAT is characterized by an alteration in gene 
expression, catalytic activity and/or regulation of activity of the enzyme. 

25 23. A transgenic cell or organism according to claims 16 to 22 wherein the 
altered biosynthetic pathway for the production of triacylglycerol is 
characterized by the prevention of accumulation of undesirable fatty acids 
in the membrane lipids. 

30 24. A process for the production of triacylglycerol, comprising growing a 
transgenic cell or organism according to claims 16 to 23 under conditions 
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whereby the said nucleotide sequence according to claims 7 to 11 is 
expressed. 

25. Triacylglycerols produced by a process according to claim 24. 

5 

26. Use of a nucleotide sequence according to claims 7 to 11 and/or an 
enzyme according to claims 1 to 6 for the production of triacylgiycerol 
and/or triacylglycerols with uncommon fatty acids. 

10 27. Use of a nucleotide sequence according to claims 7 to 11 and/or an 
enzyme according to claims 1 to 6 for the transformation of any cell or 
organism in order to be expressed in this cell or organism and result in an 
altered, preferably increased oil content of this cell or organism. 

15 
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Fig. 1: 
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Lipid synthesis (pmol/min) 
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Tab. 2: 

Tl plant T2 plant number nmol fatty acids per mg seed 
deviation 



32-4 



26-14 



1 


1277 


+11 (n=2) 


4 


1261 


+63 (n=3) 


5 


1369 


+17 (n=3) 


6 


1312 


+53 (n=4) 


7 


1197 


+54 (n=5) 


8 


1240 


+78 (n=4) 


9 


1283 


+54 (n= 5) 


10 


1381 


+35 (n=5) 


1 


1444 


+110 (n=4) 


2 


1617* 


+109 (n=4) 


3 


1374 


+37 (n=2) 


5 


1562* 


+70 (n=4) 


6 


1393 


±77 (n=4) 


7 


1433 


±98 (n=4) 


8 


1581* 


±82 (n=4) 
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Sequence Listing 

<210> 1 
<211> 1986 
<212> genomic DNA 
<213> Saccharomyces cerevisiae 

<221> CDS 
<222> (1) . . (1983) 

<400> 1 

acg ggc aca ccg ttt cga aga aat gcc cag aac caa aag agt gat tec 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

15 10 15 

gat gaa aac aat aaa ggg ggc tec get cat aac aag cga gag age aga 
Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

aac cac att cat cat caa cag gga tta ggc cat aag aga aga agg ggc 144 
Asn His He His Kis Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

att age ggc age gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 192 
He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 . 60 

aaa aga gac ggg aac ggc aga aaa cgc tgg aga gat tec aga aga ctg 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

ate ttc act etc ggc gca etc tta ggt gta etc ctg ccg ttt age cet 288 
He Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

ggc get tat cat gtt cat aac age gat age gac ttg ttt gac aac ttc 33 6 
Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 HO 

gta aat tec gac cca ccc aaa gtg tat ccg gat gac tgg aaa gat gee 3 84 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggt ata age ccg tec att gat gat acc cag gcc ggc aac 432 
Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
130 135 140 



240 
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tac tec ace tct tct tta gat gat etc agt gaa aa't ttt gec gtt ggt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tta cgt gat tat aat ate gag gec aaa cat cct gtt gta 528 
Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

atg gtt cct ggt gtc att tct acg gga att gaa age tgg gga gtt att 57 6 
Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 
180 185 190 

gga gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 624 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys A.rg Leu Trp 
195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 672 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 72 0 
Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 768 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 
Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 ■ 265 270 

ggc tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ctt 864 
Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 912 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

gaa caa ate gaa ctg ttt cat caa ttg agt ggt gaa aaa gtt tgt tta 9 60 
Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

att gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 1008 
He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
325 330 335 
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gtc gag gcc gaa ggc cct ctt tac ggt aat ggt g£t cgt ggc egg gtt 1056 
Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 



aac gaa cac ata gat tea ttc att aat gca gca ggg acg ctt ctg ggc 1104 
Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 



get cca aag gca gtt cca get eta att 
Ala Pro Lys Ala Val Pro Ala Leu lie 
370 375 

att caa tea aat acg tea gcc atg tat 
lie Gin Leu Asn Thr Leu Ala Met Tyr 
385 390 

aga att gag aga gta aaa atg tta caa 
Arg lie Glu Arg Val Lys Met Leu Gin 
405 



agt ggt gaa atg aaa gat acc 1152 
Ser Gly Glu Met Lys Asp Thr 
380 

ggt ttg gaa aag ttc ttc tea 1200 
Gly Leu Glu Lys Phe Phe Ser 
395 400 

acg egg ggt ggt ata cca tea 1248 
Thr Trp Gly Gly lie Pro Ser 
410 415 



atg eta cca aag gga gaa gag gtc att tgg ggg gat atg aag tea tct 1296 
Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 
420 425 430 



tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 1344 
Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 
435 440 445 



cga ttt gaa agg aat acg age gat get ttc aac aaa aat ttg aca atg 13 92 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 

450 455 460 

aaa gac gcc att aac atg aca tta teg ata tea cct gaa tgg etc caa 1440 

Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 
465 470 475 480 



aga aga gta cat gag cag tac teg ttc ggc tat tec aag aat gaa gaa 148 8 
Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cca atg 153 6 
Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 



gaa gta cca ctt cca gaa get cce cac atg aaa ate tat tgt ata tac 1584 
Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys lie Tyr 
515 520 525 
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ggg gtg aac aac cca act gaa agg gca tat gca cat aag gaa gag gat 1632 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Giu Asp 
530 535 540 

gac tec tct get ctg aat ttg acc ate gac tac gaa age aag caa cct 
Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

gta ttc etc acc gag ggg gac gga acc gtt ccg etc gtg gcg cat tea 
Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

atg tgt cac aaa tgg gee cag ggt get tea ccg tac aac cct gec gga 177 
Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 



att aac gtt act att gtg gaa atg aaa cac cag cca gat cga ttc gat 
lie Asn Val Thr lie Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

ata cgt ggt gga gca aaa age gec gaa cac gta gac ate etc ggc age 
lie Arg Gly Gly Ala Lys Ser Ala Giu His Val Asp lie Leu Gly Ser 
610 615 620 

gcg gag ttg aac gat tac ate ttg aaa att gca age ggt aat ggc gat 
Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 640 

etc gtc gag cca cgc caa ttg tct aat ttg age cag tgg gtt tct cag 
Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

atg ccc ttc cca atg taa 
Met Pro Phe Pro Met 
660 
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<210> 2 
<211> 661 
<212> PRT 

<213> Saccharontyces cerevisiae 
<400> 2 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 

20 25 30 

Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

35 40 45 

lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 

50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 

85 90 95 

Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 

100 105 110 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 

115 120 125 

Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 

130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 

165 170 175 

Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 

180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 

195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Vai Met Asp Lys Val Cys Trp 

210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 

245 250 255 

Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 

260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 

275 280 285 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 

290 295 300 

Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

lie Gly His Ser Met Gly Ser Gin lie lie Phe Tyr Phe Met Lys Trp 
325 330 335 
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Val Glu Ala Glu Giy Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 

340 345 350 

Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 

355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 

370 375 380 

lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 

405 410 415 

Met Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 

420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He 

435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 

450 455 460 

Lys Asp Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 

485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 

500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 

515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 

530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 

565 570 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 

580 585 590 

He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 

595 600 605 

He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Giy Ser 

610 615 620 

Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 

645 650 655 

Met Pro Phe Pro Met 
660 
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<210> 3 * ' — ■ 

<211> 2312 

<212> genomic DNA 

<213> Schizosaccharomyces pombe 



<400> 3 

ATGGCGTCTT CCAAGAAGAG CAAAACTCAT AAGAAAAAGA AAGAAGTCAA 5 0 

ATCTCCTATC GACTTACCAA ATTCAAAGAA ACCAACTCGC GCTTTGAGTG 100 

AGCAACCTTC AGCGTCCGAA ACACAATCTG TTTCAAATAA ATCAAGAAAA 150 

TCTAAATTTG GAAAAAGATT GAATTTTATA TTGGGCGCTA TTTTGGGAAT 2 00 

ATGCGGTGCT TTTTTTTTCG CTGTTGGAGA CGACAATGCT GTTTTCGACC 2 50 

CTGCTACGTT AGATAAATTT GGGAATATGC TAGGCTCTTC AGACTTGTTT 3 00 

GATGACATTA AAGGATATTT ATCTTATAAT GTGTTTAAGG ATGCACCTTT 3 50 

TACTACGGAC AAGCCTTCGC AGTCTCCTAG CGGAAATGAA GTTCAAGTTG 4 00 

GTCTTGATAT GT AC AATG AG GG AT AT C G AA GTGACCATCC TGTTATTATG 4 50 

GTTCCTGGTG TTATCAGCTC AGGATTAGAA AGTTGGTCGT TTAATAATTG 5 00 

CTCGATTCCT TACTTTAGGA AACGTCTTTG GGGTAGCTGG TCTATGCTGA 550 

AGGCAATGTT CCTTGACAAG CAATGCTGGC TTGAACATTT AATGCTTGAT 600 

AAAAAAACCG GCTTGGATCC GAAGGGAATT AAGCTGCGAG CAGCTCAGGG 65 0 

GTTTGAAGCA GCTGATTTTT TTATCACGGG CTATTGGATT TGGAGTAAAG 7 00 

TAATTGAAAA CCTTGCTGCA ATTGGTTATG AGCCTAATAA CATGTTAAGT 7 50 

GCTTCTTACG ATTGGCGGTT ATCATATGCA AATTTAGAGG AACGTGATAA 80 0 

ATATTTTTCA AAGTTAAAAA TGTTCATTGA GTACAGCAAC ATTGTACATA 850 

AGAAAAAGGT AGTGTTGATT TCTCACTCCA TGGGTTCACA GGTTACGTAC 900 

TATTTTTTTA AGTGGGTTGA AGCTGAGGGC TACGGAAATG GTGGACCGAC 950 

TTGGGTTAAT GAT C AT ATTG AAGCATTTAT AAATGTGAGT CTCGATGGTT 1000 
GTTTGACTAC GTTTCTAACT TTTGAATAGA TATCGGGATC TTTGATTGGA 1050 

GCACCCAAAA CAGTGGCAGC GCTTTTATCG GGTGAAATGA AAGATACAGG 1100 

TATTGTAATT ACATTAAACA TGTTAATATT TAATTTTTGC TAACCGTTTT 1150 

AAGCTCAATT GAATCAGTTT TCGGTCTATG GGTAAGCAAT AAATTGTTGA 1200 

GATTTGTTAC TAATTTACTG TTTAGTTTGG AAAAATTTTT TTCCCGTTCT 1250 

GAGGTATATT CAAAAATACA AATGTGCTCT ACTTTTTCTA ACTTTTAATA 1300 

GAGAGCCATG ATGGTTCGCA CTATGGGAGG AGTTAGTTCT ATGCTTCCTA 13 50 

AAGGAGGCGA TGTTGTATGG GGAAATGCCA GTTGGGTAAG AAATATGTGC 14 00 

TGTTAATTTT TTATTAATAT TTAGGCTCCA GATGATCTTA ATCAAACAAA 14 50 

TTTTTCCAAT GGTGCAATTA TTCGATATAG AGAAGACATT G AT AAGGAC C 1500 

ACGATGAATT TGACATAGAT GATGCATTAC AATTTTTAAA AAATGTTACA 1550 

GATGACGATT TTAAAGTCAT GCTAGCGAAA AATTATTCCC ACGGTCTTGC 1600 
TTGGACTGAA AAAGAAGTGT TAAAAAATAA CGAAATGCCG TCTAAATGGA 16 50 

TAAATCCGCT AGAAGTAAGA ACATTAAAGT TACTAAATTA TACTAACCCA 17 00 

AATAGACTAG TCTTCCTTAT GCTCCTGATA TGAAAATTTA TTGGGTTCAC 17 50 

GGGGTCGGAA AACCAACTGA GAGAGGTTAT TATTATACTA ATAATCCTGA 1800 

GGGGCAACCT GTCATTGATT CCTCGGTTAA TGATGGAACA AAAGTTGAAA 18 50 

ATGTGAGAGA ATTTATGTTT CAAACATTCT ATTAACTGTT TTATTAGGGT 1900 

ATTGTTATGG ATGATGGTGA TGGAACTTTA CCAATATTAG CCCTTGGTTT 1950 

GGTGTGCAAT AAAGTTTGGC AAACAAAAAG GTTTAATCCT GCTAATACAA 2000 

GTATCACAAA TTATGAAATC AAGCATGAAC CTGCTGCGTT TGATCTGAGA 2050 

GGAGGACCTC GCTCGGCAGA ACACGTCGAT ATACTTGGAC ATTCAGAGCT 2100 

AAATGTATGT TCATTTTACC TTACAAATTT CTATTACTAA CTCTTGAAAT 2150 
AAGGAAATTA TTTTAAAAGT TTCATCAGGC CATGGTGACT CGGTACCAAA 2200 

CCGTTATATA TCAGATATCC AGTACGGACA TAAGTTTTGT AGATTGCAAT 2250 

TAACTAACTA ACCGAACAGG GAAATAATAA ATGAGATAAA TCTCGATAAA 23 00 
CCTAGAAATT AA 2312 
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<400> 4 
ATGCCCCTTA 
ATCTGAAGAG 
CTTCCAAATC 
ATCGATTCTT 
TCTTCTCTTC 
CGGAGCGAAT 
AAAGAAGGTC 
CACCGGTGGG 
TTAGAAAACG 
GCTCAACAAT 
TGAGATCACG 
TGTTTGTCTG 
ATTTTCTTAA 
GTAATTGGCT 
TAGCTGCCTT 
ATGCCCTAGA 
GTAGATTTAG 
TGGGAATAGA 
GATAGAGGAC 
CTAGTTCCTT 
TCAATTTCAA 
CATCCTCGT.T 
TATATTAAAA 
TCTTCTTCTT 
AATGTTATTG 
TCAATTATCT 
TTGACAATGA 
TCAGGACTCG 
AGTGCTGATT 
ACATGGCTGC 
TTCTCATCGT 
TTACTTAAGG 
GACTCTTAGC 
GTGGAAAAAA 
CTACATTTTA 
TGGGCCAGAT 
GACCATTTCT 
GCAAAGGATG 
TGATCAGAAC 
TCTAATAACA 
TTGCTGCTTA 
GATAATTCTT 
TAACATTTGT 
GCGCTGGAGC 
TCTTTCAATT 
TAAAGACTCG 
AGAACTTTAC 
GACACCGATA 
ACGCACATGG 
TATGGGGCGG 
AAAAAGCAAA 
TTC C AAG AAA 
AAGTAGCAGA 
AGGACATATA 
ATATCTGTAC 
CCAAATCACA 
TGCTGGGATC 



TTCATCGGAA 
GTGGTGCACG 
CC AC CAT AAG 
GTTGTTGGTT 
CTTTACAACG 
CACGGGTCCT 
TTAAGGCGAA 
CTCGAGCTTT 
TTTGTGGGGT 
TCTCACTCTT 
CACTTGTTGC 
TCTTACTCTT 
GAGACTATTT 
TGGACTATTT 
GGAATTTCAA 
GTCCGTTCAT 
CTTTGTGTAG 
GAAGTTCTAA 
TGTTGCTTTA 
TTTGATCTTT 
AGTTTCACAT 
CTGTTATCCA 
AAAAAGTGTT 
TCGGCTCAGT 
TTGATGGTAA 
CTTTGATTCA 
AACTGGGTTG 
TGGCTGCTGA 
GCTAACCTTG 
ATATGACTGG 
TCTTTCTATT 
CTTAAATATG 
CGTATGAAAA 
AGCAGTTATA 
TGAAGTGGGT 
TGGTGTGCAA 
TGGTGTTC C A 
TTGCAGTTGC 
CTTGGCTCTG 
TTGCTATATT 
TGTAACTGAA 
ACGCATTGCT 
CATACTGTCT 
TTCCATGCTT 
TTCTTGTATA 
TTGGATTAGT 
TTTCTTCGAA 
TATTTAGACT 
GACTCAACAA 
GCTTGATTGG 
AGAACAACGA 
AGTCCTGTTA 
GGCTGCGCCA 
AATCATAATA 
ATTTTATCTG 
CCTGTCGTGA 
AAAGCTATCG 



AAAGCCGACG 
ATGAGGATTC 
AAATCGAACG 
CATTGGGTGT 
CAATGCCTGC 
TTGCCTGACC 
ACATCCTGTT 
GGGAAGGCAA 
GGAACTTTTG 
CCTTTATATT 
TTCTTCAACA 
TACTTT^TTT 
CTGTATGTGT 
CTGTTTGATT 
GTCATCTTAT 
AAC AAGTT AC 
CGTATAATGA 
CTACATCTGT 
TTATTCAACT 
CAGCTCAATG 
CGAGTTTATT 
GCTTTGAACT 
TTGTGGGTTG 
GTTCATGTTT 
CAGTGGTATA 
GGCCTCTATG 
GATCCAGCTG 
CTACTTTGCT 
CACATATTGG 
CGGCTTTCGT 
ATTCTGTTCC 
TTTCATGTTG 
GTAATATAGA 
GTTCCGCATT 
TGAGGCACCA 
AGTATATTAA 
AAAGC TGTTG 
CAGGTATTGA 
GAACTCAAAG 
ATCGCTGCAA 
ACTCTCTTGA 
CTGTGATGAC 
TTTGGAGGGC 
GCATTCTTTA 
TGCATCTATG 
TGCTCTATTA 
AATTGCAGAG 
TCAGACCTTG 
TGTCTATGTT 
TCACCGGAGA 
AAC TTGTGGT 
ACTATGGAAG 
TCTGAGATTA 
AACCTTGTAC 
GTGAAGGGTG 
CGTGTGGACA 
CTGAGTATAA 



GAGAAACCAT 
GCAAAAGAAA 
GAGGAGGGAA 
GTGTGTGTAA 
GAGCTTCCCT 
CGCCCGGTGT 
GTCTTCATTC 
ACAATGCGCT 
GTGAAGTCTA 
GGGATTTGGA 
TCACTCAAAC 
TTTTTTTGAT 
AAGGTAAGCG 
GTTAACTTTA 
TGCCAAATCT 
TTCCTTTACT 
AGTAGTGTTT 
GGAAAGTGTG 
ATGTATATGT 
TGCTTTTCTC 
CACATGTCTT 
CCTCCCGACC 
CATCTTTGTT 
TTGCTATGGT 
GTTGATAGTA 
TTGGGTGGAA 
GTATTAGAGT 
CCTGGCTACT 
ATATGAAGAG 
TTCAGAACAC 
ATGTTACGTT 
AATTAATAGG 
GTTGATGGTT 
CCATGGGGGT 
GCTCCTCTGG 
GGCGGTGATG 
CAGGGCTTTT 
ATATCTGCTT 
TTATTCTACT 
CTGACATTGG 
GATTAGACAA 
CAGTTTCTTA 
ATTGAATTTT 
CCAATTAGCG 
GTCTTTTATT 
GTCACTTGGT 
CGATTGCCCC 
CAGCATGTAA 
AC CG AAGGG A 
AAGGCCACAC 
GAAGCAGGTG 
GATGATATCT 
ATAATATTGA 
ATTTTGTGAT 
CTGTCAAAGG 
GAGTACCATG 
GGTCTACACT 



CGACGCCGCC 

CCACACGAAT 

GTGGTCGTGC 

CCTGGTGGTT 

CAGTATGTAA 

TAAGCTCAAA 

CTGGGATTGT 

GATGGTTTAT 

CAAAAGGTGA 

TTGGATCTGA 

TTTAATTCCA 

GTGAAACGCT 

TTCCAAGGAC 

GGATATAAAA 

GTTGCTAGAC 

GTCGTTGCGT 

TATGTTTTGT 

TTCAGGCTGT 

GTAATTAAAG 

AATTTTTTTC 

GAATTTCGTC 

CTGCTATGGA 

ACGATCTGCA 

AGAGATGGGC 

TCTTAACTAA 

CACATGTCAC 

TCGAGCTGTA 

TTGTCTGGGC 

AAAAATATGT 

AGAGGTTCTT 

TCTTTCTTCA 

TACGTGATCA 

TCTACCAACG 

CTTGTATTTT 

GTGGCGGGGG 

AACATTGGTG 

CTCTGCTGAA 

AT AC TTTTGA 

AAATATCAAT 

TTGATTATTT 

ATGATGAATT 

GCTTCGACGA 

GCTATGGAAA 

TTATTCTGCT 

TCTTCTTAAT 

TCCTTAATAT 

AGGATTCTTA 

TGAGAATGAC 

GGTGACACGA 

CTGTTGTGGG 

AAAACGGAGT 

TTTGGGAAAG 

TTTTCGAGTA 

TGTATGATGA 

TCAGAGTATC 

ACATGGGAAT 

GCTGGTGAAG 



1000 



50 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

850 

900 

950 

1050 
1100 
1150 
1200 
1250 



1300 
1350 
1400 
1450 
1500 
1550 

1650 
1700 
1750 
1800 
1850 
1900 
1950 
2000 
2050 
2100 



1600 



2150 
2200 
2250 
2300 
2350 
2400 
2450 
•2500 
2550 
2600 
2650 
2700 
2750 
2800 
2850 
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CTATAGATC? ACTACATTAT GTTGCTCCTA AGATGATGGC GCGTGGTGCTC ZSO'O*" 

GCTCATTTCT CTTATGGAAT TGCTGATGAT TTGGATGACA CCAAGTATCA 2950 

AGATCCCAAA TACTGGTCAA ATCCGTTAGA GACAAAGTAA GTGATTTCTT 3 000 

GATTCCAACT GTATCCTTCG TCCTGATGCA TTATCAGTCT TTTTGTTTTC 3 050 

GGTCTTGTTC- GATATGGTTT TCAGCTCAAA GCTTACAAAG CTGTTTCTGA 3100 

GCCTTTCTCA AAAAGGCTTG CTCAGTAATA TTGAGGTGCT AAAGTTGATA 315 0 

CATGTGACTC TTGCTTATAA ATCCTCCGTT TGGTTTGTTC TGCTTTTTCA 3200 

GATTACCGAA TGCTCCTGAG ATGGAAATCT ACTCATTATA CGGAGTGGGG 3 25 0 
ATACCAACGG AACGAGCATA CGTATACAAG CTTAACCAGT CTCCCGACAG 3300 

TTGCATCCCC TTTCAGATAT TCACTTCTGC TCACGAGGAG G AC G AAG AT A 3350 

GCTGTCTGAA AGCAGGAGTT TACAATGTGG ATGGGGATGA AACAGTACCC 34 00 

GTCCTAAGTG CCGGGTACAT GTGTGCAAAA GCGTGGCGTG GCAAGACAAG 3450 

ATTCAACCCT TCCGGAATCA AG AC TT AT AT AAGAGAATAC AATCACTCTC 3500 

CGCCGGCTAA CCTGTTGGAA GGGCGCGGGA CGCAGAGTGG TGCCCATGTT 3550 

GATATCATGG GAAACTTTGC TTTGATCGAA GATATCATGA GGGTTGCCGC 3600 

CGGAGGTAAC GGGTCTGATA TAGGACATGA CCAGGTCCAC TCTGGCATAT 3 650 
TTGAATGGTC GGAGCGTATT GACCTGAAGC TGTGA 3 685 
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<210> 5 ~ 
<211> 2427 
<212> cDNA 

<213> Arabidopsis thaliana 



<400> 5 

AGAAACAGCT CTTTGTCTCT CTCGACTGAT CTAACAATCC CTAATCTGTG 50 

TTCTAAATTC CTGGACGAGA TTTGACAAAG TCCGTATAGC TTAACCTGGT 100 

TTAATTTCAA GTGACAGATA TGCCCCTTAT TCATCGGAAA AAGCCGACGG 150 

AGAAACCATC GACGCCGCCA TCTGAAGAGG TGGTGCACGA TGAGGATTCG 200 

CAAAAGAAAC CACACGAATC TTCCAAATCC CACCATAAGA AATCGAACGG 250 

AGGAGGGAAG TGGTCGTGCA TCGATTCTTG TTGTTGGTTC ATTGGGTGTG 300 

TGTGTGTAAC CTGGTGGTTT CTTCTCTTCC TTTACAACGC AATGCCTGCG 350 

AGCTTCCCTC AGTATGTAAC GGAGCGAATC ACGGGTCCTT TGCCTGACCC 400 
GCCCGGTGTT AAGCTCAAAA AAAGAAGGTC TTAAGGCGAA ACATCCTGTT 450 

GTCTTCATTC CTGGGATTGT CACCGGTGGG CTCGAGCTTT GGGAAGGCAA 500 

ACAATGCGCT GATGGTTTAT TTAGAAAACG TTTGTGGGGT GGAACTTTTG 5 50 

GTGAAGTCTA CAAAAGGCCT CTATGTTGGG TGGAACACAT GTCACTTGAC 600 

AATGAAACTG GGTTGGATCC AGCTGGTATT AGAGTTCGAG CTGTATCAGG 650 

ACTCGTGGCT GCTGACTACT TTGCTCCTGG CTACTTTGTC TGGGCAGTGC 7 00 

TGATTGCTAA CCTTGCACAT ATTGGATATG AAGAGAAAAA TATGTACATG 7 50 

GCTGCATATG ACTGGCGGCT TTCGTTTCAG AACACAGAGG TACGTGATCA 800 

GACTCTTAGC CGTATGAAAA GTAATATAGA GTTGATGGTT TCTACCAACG 850 

GTGGAAAAAA AGCAGTTATA GTTCCGCATT CCATGGGGGT CTTGTATTTT 900 

CTACATTTTA TGAAGTGGGT TGAGGC AC C A GCTCCTCTGG GTGGCGGGGG 950 
TGGGCCAGAT TGGTGTGCAA AGTATATTAA GGCGGTGATG AACATTGGTG 1000 

GACCATTTCT TGGTGTTCCA AAAGCTGTTG CAGGGCTTTT CTCTGCTGAA 1050 

GCAAAGGATG TTGCAGTTGC CAGAGCGATT GCCCCAGGAT TCTTAGACAC 1100 

CGATATATTT AGACTTCAGA CCTTGCAGCA TGTAATGAGA ATGACACGCA 115 0 

CATGGGACTC AACAATGTCT ATGTTACCGA AGGGAGGTGA CACGATATGG 1200 

GGCGGGCTTG AT TGGTC AC C GGAGAAAGGC CACACCTGTT GTGGGAAAAA 1250 

GCAAAAGAAC AACGAAACTT GTGGTGAAGC AGGTGAAAAC GGAGTTTCCA 1300 

AGAAAAGTCC TGTTAACTAT GGAAGGATGA TATCTTTTGG GAAAGAAGTA 13 50 

GCAGAGGCTG CGCCATCTGA GATTAATAAT ATTGATTTTC GAGGTGCTGT 14 00 

CAAAGGTCAG AGTATCCCAA ATCACACCTG TCGTGACGTG TGGACAGAGT 14 50 

ACCATGACAT GGGAATTGCT GGG ATCAAAG CTATCGCTGA GTATAAGGTC 1500 

TACACTGCTG GTGAAGCTAT AGATCTACTA CATTATGTTG CTCCTAAGAT 1550 

GATGGCGCGT GGTGCCGCTC ATTTCTCTTA TGGAATTGCT GATGATTTGG 1600 

ATGACACCAA GTATCAAGAT CCCAAATACT GGTCAAATCC GTTAGAGACA 1650 

AAATT AC CG A ATGCTCCTGA GATGGAAATC TACTCATTAT ACGGAGTGGG 17 00 

GATACCAACG GAACGAGCAT AC GT AT AC AA GCTTAACCAG TCTCCCGACA 17 50 

GTTGCATCCC CTTTCAGATA TTCACTTCTG CTCACGAGGA GGACGAAGAT 1800 

AGCTGTCTGA AAGCAGGAGT TTACAATGTG GATGGGGATG AAACAGTACC 1850 

CGTCCTAAGT GCCGGGTACA TGTGTGCAAA AGCGTGGCGT GGCAAGACAA 1900 

GATTCAACCC TTCCGGAATC AAGACTTATA TAAGAGAATA CAATCACTCT 1950 

CCGCCGGCTA ACCTGTTGGA AGGGCGC GGG ACGCAGAGTG GTGCCCATGT 2 00 0 

TGATATCATG GGAAACTTTG CTTTGATCGA AGATATCATG AGGGTTGCCG 2 050 

CCGGAGGTAA CGGGTCTGAT ATAGGACATG ACCAGGTCCA CTCTGGCATA 2100 

TTTGAATGGT CGGAGCGTAT TGACCTGAAG CTGTGAATAT CATGATCTCT 2150 

TTAAGCTGTC CTGTCAGCTT ATGTGAATCC AATACTTTGA AAGAGAGATC 2200 

ATCATCAATT CATCATCATC GTCATCATCA TGATGCTCAA CTCACAAAGA 2250 

AGCCTGAGAA TGATACTTTG GTgCGAAATT CTCAATACCT CTTTAATATT 2300 

CTTATTGAAT GT AAATT ATA CAATCCTATC TAATGTTTGA ACGATAACAC 2350 

AAAACTTGCT GCNGCCATGT TTGTTTGTCT TGTCAAAAGC ATCAATTTGT 24 00 
GGGTTAAAAA AAAAAAAAAA AAAAAAA 2427 
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<210>~ 6 
<211> 671 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 6 

MPLIHRKKPT EKPSTPPSEE WHDEDSQKK 
IDSCCWFIGC VCVTWWFLLF LYNAMPASFP 
KEGLKAKHPV VFIPGIVTGG LELWEGKQCA 
LCWVEHMSLD NETGLDPAGI RVRAVSGLVA 
IGYEEKNMYM AAYDWRLSFQ NTEVRDQTLS 
VPHSMGVLYF LHFMKWVEAP APLGGGGGFD 
KAVAGLFSAE AKDVAVARAI APGFLDTDIF 
MLPKGGDTIW GGLDWSPEKG HTCCGKKQKN 
GRMI SFGKEV AEAAPSEINN IDFRGAVKGQ 
GIKAIAEYKV YTAGEAIDLL HYVAPKMMAR 
PKYWSNPLET KLPNAPEKEI YSLYGVGIPT 
FTSAKEEDED SCLKAGVYNV DGDETVPVLS 
KTYIREYNHS PPANLLEGRG TQSGAHVDIM 
IGHDQVKSGI FEWSERIDLK L 



PHESSKSHHK KSNGGGKWSC 50 

QYVTERITGP LPDPPGVKLK 100 

DGLFRKRLWG GTFGEVYKRP 150 

ADYFAPGYFV WAVLIANLAH 200 

RMKSNIELMV STNGGKKAVI 250 

WCAKYIKAVM NIGGPFLGVP 300 

RLQTLQHVMR MTRTWDSTMS 350 

NETCGEAGEN GVSKKSPVNY 400 

SIPNHTCRDV WTEYHDMGIA 450 

GAAHFSYGIA DDLDDTKYQD 500 

ERAYVYKLNQ SPDSCIPFQI 550 

AGYMCAKAWR GKTRFNPSGI 600 

GNFALIEDIM RVAAGGNGSD 650 
671 
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<210> 7 
<211> 643 - 
<212> cDNA 
<213> Zea mays 

<221> CDS 
<222> (1) . . (402) 

<400> 6 

CGG GAG AAA ATA GCT GCT TTG AAG GGG GGT GTT TAG TTA GCC GAT GGT 4 8 

Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
1 5 10 15 

GAT GAA ACT GTT CCA GTT CTT AGT GCG GGC T?AC ATG TGT GCG AAA GGA 96 
Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

TGG CGT GGC AAA ACT CGT TTC AGC CCT GCC GGC AGC AAG ACT TAC GTG 144 
Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

AGA GAA TAC AGC CAT TCG CCA CCC TCT ACT CTC CTG GAA GGC AGG GGC 192 
Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 



PCT/EPOO/02701 



ACC CAG AGC GGT GCA CAT GTT GAT ATA ATG GGG AAC TTT GCT CTA ATT 240 
Thr Gin Ser Gly Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie 
65 70 75 80 

GAG GAC GTC ATC AGA ATA GCT GCT GGG GCA ACC GGT GAG GAA ATT GGT 2 88 
Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 

85 90 95 

GGC GAT CAG GTT TAT TCA GAT ATA TTC AAG TGG TCA GAG AAA ATC AAA 33 6 
Gly Asp Gin Val Tyr Ser Asp lie Phe Lys Trp Ser Glu Lys He Lys 
100 105 HO 

TTG AAA TTG TAA CCTATGGGAA GTTAAAGAAG TGCCGACCCG TTTATTGCGTTCC 3 91 
Leu Lys Leu 
115 

AAAGTGTCCT GCCTGAGTGC AAC T C TGG AT TTTGCTTAAA TATTGTAATT TTTCACGC 449 
TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGACGC C AATGC GAT A CGATGTTG 507 
TACCGCTATT TTCAGCATTG TATATTAAAC TGTACAGGTG TAAGTTGCAT TTGCCAGC 5 65 
TGAAATTGTG TAGTCGTTTT CTTTACGATT TAATANCAAG TGGCGGAGCA GTGCCCCA 62 3 
AGCNAAAAAA AAAAAAAAAA 643 
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< 21-0-^-8 — 

<211> 115 
<212> PRT 
<213> Zea mays 

<400> 8 

Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

Thr Gin Ser Gly Ala His Val Asp He Mec Gly Asn Phe Ala Leu He 
65 70 75 80 

Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 
85 90 95 

Gly Asi3 Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 HO 



Leu Lys Leu 
115 
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<2T0>-9 

<211> 616 
<212> cDNA 

<213> Neurospora crassa 



<400> 9 

ggcggcgaag acganggcgg 
agatggatct accctctaga 
gtntacngtt tntatgggta 
tggcgcccga tcccgggacg 
actttgactn aggggcacat 
cggcacagtg aaccttatga 
aaatgaagag atacaatcct 
ccgcatgaac cagaacggtt 
tcacgtggac attctaggaa 
tggcggcagg tcgaggcgac 
cttaaacatg tagaaaaggt 
acataggtta ctcaatagta 
aaaaaaaaaa aaaaaa 



aagttggagg ctaacgagaa 
gacacgacta cciittgcacc 
ggaagc cgac ggagcgagc c 
acaacgcatc tttagatgac 
tgaccacggt gtgactttgg 
gtttggggna cctgcgcaat 
gcgggcccaa aaataaccgt 
caatccgaga ggagggccga 
ggcagaatct aaacgagtac 
acaartgagg actttattac 
tgaaatttat gaagagtaat 
tgactaatta aaaaaaaatc 

616 



tgacnctcgg 


50 


cagccccaag 


100 


tacatctatc 


150 


gatcgatacg 


200 


gcgaaggcga 


250 


aaggggtgga 


300 


ggtcgagang 


350 


atacggcgga 


400 


attcttaaag 


450 


tagtaatatt 


500 


taaatacggc 


550 


tttcttccaa 


600 
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<2ro>~TO 

<211> 1562 

<212> genomic DNA 

<213> Arabidopsis thai i ana 



<400> 10 

ATGAAAAAAA TATCTTCACA TTATTCGGTA 
GGTGACGATG ACCTCGATGT GTCAAGCTGT 
TGATTCTGGT TCCAGGAAAC GGAGGTAACC 
AG AG AAT AC A AGCCAAGTAG TGTCTGGTGT 
TCATAAGAAG AGTGGTGGAT GGTTTAGGCT 
TATTGTCTCC CTTCACCAGG TGCTTCAGCG - 
GACCCTGATT TGGATGATTA CCAAAATGCT 
TCCTCATTTC GGTTCGACCA AATCACTTCT 
GGTTAGT AC T TTCCAAGATA TATCATTTTG 
AAATAGACAT AAATTTGGGG GATTATTGTT 
GCTAGTCGGT AATGTGAGTG TTATGTTAGT 
GTGATTTTCC ATTTTAAATG AAGCTAGAAA 
CTATGTCATG AGAATTATAA GGACACTATG 
GGTTTGATTT GCAGAGATGC CACATCTTAC 
TCTAGAGAAA AAATGCGGGT ATGTTAACGA 
CATATGATTT CAGGTACGGC CTGGCTGCTT 
GCCTCACAGT TCCTACAAGA CCTCAAACAA 
CGAGAACGAA GGAAAGCCAG TGATACTCCT 
TTTTCGTCCT CCATTTCCTC AACCGTACCA 
TACATCAAAC ACTTTGTTGC ACTCGCTGCG 
TCAGATGAAG ACATTTGCTT CTGGCAACAC 
ACCCTTTGCT GGTCAGACGG CAT C AG AGG A 
CTACTTCCAT CTACCAAAGT GTTTCACGAC 
AACTCCCCAG GTTAACTACA CAGCTTACGA 
ACATTGGATT CTCACAAGGA GTTGTGCCTT 
TTAACAGAGG AGCTGAT GAC TCCGGGAGTG 
GAGAGGAGTT GATACACCGG AGGTTTTGAT 
ATAAGCAACC AGAGATTAAG TATGGAGATG 
GCGAGCTTAG CAGCTTTGAA AGTCGATAGC 
TGGAGTTTCG CATACATCTA TACTTAAAGA 
TTATGAAGCA GATTTCAATT ATTAATTATG 
GTCAATGAAT GA 



GTCATAGCGA 


TACTCGTTGT 


50 




GGGTAGCAAC 


GTGTACCCTT 




100 


AGCTAGAGGT 


AC GGC TGG AC 




150 


AGCAGCTGGT 


TATATCCGAT 




200 


ATGGTTCGAT 


GCAGCAGTGT 




250 


ATCGAATGAT 


GTTGTACTAT 




300 


CCTGGTGTCC 


AAACCCGGGT 




350 


AT AC CTCGAC 


CCTCGTCTCC 




400 


GGACATTTGC 


ATAATGAACA 




450 


ATATCAATAT 


CCATTTATAT 




500 


ATAGTTAATG 


TGAGTGTTAT 




550 


GTTGTCGTTT 


AATAATGTTG 




600 


TAAATGTAGC 


TT AAT AAT AA 




650 


ATGGAACATT 


TGGTGAAAGC 




700 


CCAAACCATC 


CTAGGAGCTC 




750 


CGGGCCACCC 


GTCCCGTGTA 




800 


TTGGTGGAAA 


AAAC TAGC AG 




850 


CTCCCATAGC 


C T AGG AGG AC 




900 


CCCCTTCATG 


GCGCCGCAAG 




950 


CCATGGGGTG 


GGACGATCTC 




1000 


ACTCGGTGTC 


CCTTTAGTTA 




1050 


CCTCCGAGAG 


TAACCAATGG 




1100 


AGAACTAAAC 


CGCTTGTCGT 




1150 


GATGGATCGG 


TTTTTTGCAG 




1200 


AC AAG AC AAG 


AGTGTTGCCT 




1250 


CCAGTCACTT 


GCATATATGG 




1300 


GTATGGAAAA 


GGAGGATTCG 




1350 


GAGATGGGAC 


GGTTAATTTG 




1400 


TTGAACACCG 


TAGAGATTGA 




1450 


CGAGATCGCA 


CTTAAAGAGA 




1500 


AATTAGCCAA 


TGTTAATGCC 




1550 



1562 
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<210-> "1-1 • * 

<211> 3896 

<212> genomic DNA 

<213> Arabidopsis chaliana 



<400> 11 

ATGGGAGCGA ATTCGAAATC AGTAACGGCT 
TTTTTTCTTG ATTTGCGGTG GCCGAACTGC 
TTCACGGCGA CTACTCGAAG C T ATCGGGT A 
TCGACGCAGC TACGAGCGTG GTCGATCCTT 
GGACTTCAAT CCGCTCGACC TCGTATGGCT 
ATCTTCATTT CCTTCGCTCC TTATTCTGTC 
AATTCCAAGC GAAATATAGC AATGAAGCAT 
TCGTTCATTA GTCAACAGTG ACGCTTCTGA 
AAAACAGCTG ACTCGGCGAG TGTTTCCCAT 
TAGCGCAATG AATGTGTAAT TAGTCTGCGC 
CAAGTTTTTC AGAGTGCTCA ATAGTAGTTA 
CTTGTGCATT GTGATTCTTT TGGTTGTTGC 
TGGTTTACAG CTTCTTTCTG CTGTCAACTG 
TAGATCCTTA TAATCAAACA GACCATCCCG 
AGTGGTCTTT CAGCCATCAC AGAATTGGAT 
TTTCGGATTT TTCTTTCTTT TGAGTTTTCT 
TGTGATATAA TATGGCTAAG TTCATTAATT 
TTTCTACTGT CTGGAAAGAG TGGCTTAAGT 
GAAGCAAATG CAATTGTCGC TGTTCCATAC 
CAAATTGGAA GAGCGTGACC TTTACTTTCA 
TCAGGCTAAT GTCTTTTATC TTCTCTTTTT 
TCTGGTCGTC TTCCTTTTTG CAGGTTGACC 
CCGTGGCGGC CCTTCTATAG TATTTGCCCA 
TCAGATACTT TCTGGAATGG CTGAGGCTAG 
TTGAAGTGGC TTGATCAGCA TATCCATGCT 
CCTACTATCC TTAAGTTACC ATTTTATTTT 
TGTTGTGACT TACTGGATTG AGCTCGATAC 
GAGCTCCTCT TCTTGGTTCT GTTGAGGCAA 
GTAACGTTTG GCCTTCCTGT TTCTGAGGTG 
TTTAAGTAGT TGATATCAAC CAGGTCTTAT 
TGAAAGTATT ACTTTTGTTA ATTGAACTGC 
TAGATCTTGA AGTGCTAGTT ATCAAAGAAC 
GTCAGCGGCC TTAGCTAATA CAACCAAACC 
TTCAGATTAT TATGGTAGAC TTTAAGTTGA 
CTTTTTATTT TAATAGGCTA TGATTTGTTT 
TGACATGCGC TTCTCATGTT TTTTGTTGGC 
GGTTGTTGTC CAATTCTTTT GCGTCGTCAT 
AAGAATTGCA AGGGTGATAA CACATTCTGG 
TGCAAAGAAA GATAAGCGCG TATACCACTG 
CAAAATATTC TGGCTGGCCG ACAAATATTA 
ACTAGCGGTT AGACTCTGTA TATGCAACTG 
CCAAGAATGT TCACTCTCAT ATTTCGTTCC 
TACAGAAACA GCTCTAGTCA ACATGACCAG 
CCCTTTTGTC TTTCACAGCC CGTGAACTAG 
GCAATAGAAG ACTATGACCC AGATAGCAAG 
GAAGTACGTA CCTTTCTTTG TGATAAGAAA 
TTGCTGGCTT CTTGTACGTC AAATTGTTTT 
TGTTCATATG CTTTGTCTTT CTTACTATAA 
CCTTATTATT GATTATCAGT TCTCTCCTTA 
GTTTACAGTT ATGAATGCAA AAGGGGGTAT 
ATTCTCTAGT TTGTTTTGAC TAATAGCGTC 
TCTTTGTGAA TTATATATAA CATGCTAACT 
TGATGACCCT GTTTTTAATC CTCTGACTCC 
AAAATGTATT TTGCATATAT GGTGCTCATC 
ATTCTCAATA TCACATTATG CGTTGACTTT 
GTTTGCAATA TCTTTTTGAA TTATGATTTA 
GCTATTAAGC GTTAAAGGTA CTAAATGTAT 
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TGATGAAGAG 


GAATATCAAT 
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TTAACATTGA 


AATTCCTTCC 


? n n n 


TAACACTAAC 


AAAAGTTTCA 


/IUjU 


TTTGATGTGT 


ATCCATCAGT 


9 1 n n 


CATGGAATGT 
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*)1 cn 
JU 


C AG ATGGG AC 


TCTTTTCAAA 


^* u u 


AGGATGTTAC 


ACCAGTTAAA 
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Z Z D U 


TATTGCTCAT 


CGATCATCAC 


o^nn 


GTTTAAATCT 


CTATATCAAT 


2350 


GAAACAAGTA. TAATCAGAAA 


2400 


TATTATGGAA 


TGTCTTTTTC 


2450 


T TT AG T TG AT 


TGATTCTCTC 


2500 


AATTTTGTTT 


TTCTAGCAAA 


2550 


ATACTTTTCA 


GGTTGTATCA 


2600 


TTGGGAGAGA 


CCACCTATAA 


2650 


TAAAGACAGA 


GGTATGATGC 


2700 


GTTATTATAT 


TCCCCATTTG 


2750 


TCTTCTCCCT 


TGCATCTTAT 


2800 


GAAGCTGTCT 


GTCATAGGTT 


2850 
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ggttattact" ttgccccaag tggc aaac ct~tatcctgat a AT.TGGATCAT 291K)— - 

CACGGATATC ATTTATGAAA CTGAAGGTTC CCTCGTGTCA AGGTAATTTT 2 950 

CCGCAATGGC AGAAGTAAAA CAGGAAGGCA AAGTCTTCTG TATCAGTCTA 3 000 

GTGGCATGTT ATCTCAGTTG C AT AAG C AAA TTATTAAACA ACTAAAATTT 3 050 

AAGTACTTTT TTATCATTCC TTTTGAGCTT AGTGGATGAT CAGTGGCTTA 3100 

AAGTGGGAAG AGGTGTTGCA TGAAACATGA CACTTGTATC AAAGATAACT 3150 

AGCAAAACAA AACTAACCCA TTTCTGAATT TCATATTATT AGGAGTAGTC 32 00 

GTGCTTTTAA AAAATTTGTT TTAAGAAACC GAAAAACTAG TTCATATCTT 325 0 

GATTGTGCAA TATCTGCAGG TCTGGAACTG TGGTTGATGG GAACGCTGGA 3300 

CCTATAACTG GGGATGAGAC GGTAAGCTCA GAAGTTGGTT TTGAAATTAT 33 5 0 

CTTCTTGCAA ACTACTGAAG ACTAAGATAA TACTTGCTTC TGGAACACTG 3400 

CTTGCTATGT TCTCTAGTAC ACTGCAATAT TGACTCTCCG CTACTTTTAT 3 45 0 

TGATTATGAA ATTGATCTCT TATAGGTACC CTATCATTCA CTCTCTTGGT 3 50 0 

GCAAGAATTG GCTCGGACCT AAAGTTAACA TAACAATGGC TCCCCAGGTA 355 0 

CTCTTTTTTA GTTCCTCACC TTATATAGAT CAAACTTTAA GTGTACTTTT 3 60 0 

CTGGTTATGT GTTGATTTAC CTCCAATTTG TTCTTTCTAA AAATCATATA 3 650 

TCTCTGTACT CCTCAAGAAC TTGTATTAAT CTAAACGAGA TTCTCATTGG 37 00 

GAAAATAAAA CAACAGCCAG AACACGATGG AAGCGACGTA CATGTGGAAC 3750 

TAAATGTTGA TCATGAGCAT GGGT C AG AC A TCATAGCTAA CATGACAAAA 3 800 

GCACCAAGGG TTAAGTACAT AACCTTTTAT GAAGACTCTG AGAGCATTCC 3 850 

GGGGAAGAGA ACCGCAGTCT GGGAGCTTGA TAAAAGTGGG TATTAA 3 896 
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<210>" 12" — " 
<211> 709 
<212> cDNA 

<213> Lycopersicon esculentum 



<400> 12 

CTGGGGCCAA AAGTGAACAT 
TCAGATGTAC AAGTGCATCT 
CATTCCCAAT ATGACAAAGT 
AGGATTCTGA AAGTTTTC C A 
AAAGCAAATC ACAGGAACAT 
GTGGCTTGAG ATGTGGCATG 
T T AC AAAAGG TGGTGTCTGA 
GTTTGTATTG ACATTGTAAG 
TGAGGGATGA GGACTGCTAT 
GCTGAACATT GTGAATACAG 
TATTCTCTTT TTGTGTATTT 
GATATGTATT CGGGGATGTT 
TCTACATCTC ACATCCTGTC 
TTGGCGGAAC AACAAGTTTG 

TTCAGAGAG 



AACAAGGACA C C AC AGTC AG 
AAATATAGAG CATCAACATG 
TACCTACAAT GAAGTACATA 
GGGACAAGAA CAGCAGTTTG 
TGTCAGATCT CCAGCTTTGA 
ATATTCATCC TGATAAAAAG 
TCCTCACTAT TTTCTTCTAT 
TATTGCAACA AAAAGCAAAG 
TGGGATTACG GGAAAGCTCG 
GTTAGAATAT TCAAATTATA 
AGGCCACCTT TCCCCGGTCA 
CACCTGGGAC AGAGTTGCAG 
ACACTATGTG TGATATTTAA 
CACAAACATT TGAAGAAGAA 

709 



ti s-\ /*■■» *k rp/— i 71 rr\f> rp 

AGCA j. QjA J. <j i 


0 U 


GTGAACjA I A 1 


1UU 


ACCTATTATG 


lb u 


GGAGCTTGAT 


200 


TGCGGGAGCT 


250 


TCCAAGTTTG 


300 


AAATGTTTGA 


350 


CGTGGGCCTC 


400 


ATGTGCATGG 


450 


TTTTGCAAAA 


500 


CAACGATGCA 


550 


ATTGAAGAGT 


600 


GAAACTTTGT 


650 


AGCGAAATGA 


700 
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<210> 13 
<211> 623 
<212> PRT 

<213> Schizosaccharomyces pombe 



<400> 13 



MASSKKSKTHKKKKETVKSPID 0 
FFF AVGDDNAVFD P ATLDKFGNMLG S SDLFDD IKGYLS YNVFKDAPFTTDKPSQS PSGNEVQVGLDMYNE1 4 0 
GYRSDHPVIMVPGVI SSGLESWSFNNCS I PYFRKRLWGSWSMLKAMFLDKQCVJLEHLMLDKKTGLDPKGI2 1 0 
KLRAAQGFEAADFFITGYWIWSKVIENLAAIGYEPNNML^ 8 0 

IWKKKVVT.ISHSMGSQVTYYFFKWVEAEGYGNGGPTWVOT 

TGIVITLNILEKFFSRSERAMMVRTMGGVSSMLPKGGDVAPDDLNQTNFSNGAIIRYREDIDKDHDEFDI420 

DDALQFLKNVTDDDFKVMIJUa^ 9 0 

ERGYYYTNNPEGQPVIDSSVNDGTKVENGIVMDDGDGTLPILALGLVCNKVWQTK^ 6 0 

HEPAAFDLRGGPRS AEKVDILGHSELNEI ILKVS SGHGDSVPNRYI SDIQEI INEXNLDKPRN 623 
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<210> 14 
<211> 432 . 
<212> PRT 

<213> Arabidopsls thaliana 
<400> 14 

MKK I S SK Y S WI A I L VVVTMT SMC Q AVG S NW 

SGGWFRLWFDAAVXLSPFTRCFSDRMMLYYDPDLDDYQNAPGVQTRVPHFGSTKSLLYLDPRLRDATSYM1 4 0 
SHLVKALEKKCGYVNDQTILGAPYDFRYGLAASGHPSRVASQFLQDLKQLVZKTSSENEGKPVILLSHSL210 
GGLFVLHFLNRTTPSWRRKYIKHFVALAAPWGGTISQMKTFASGNTLGVPLWPLLVRRHQRTSESNQWL2 8 0 
LPSTKWHDRTKPLVVTPQVNYTAYEMDRFFADIG 5 0 

TPEVLMYGKGGFDKQPEIKYGDGDGTVNLASLAALKVDSLNTVEIDGVSHTSILKDEIALKEIMKQISII 
NYE LANVN AVNE 432 
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<210> 15 

<2I1> 552 

<212> PRT 

<213> Arabidopsls chaliana 



<400> 15 

MGANSKSVTASFTVIAVFFLICGGRTAVEDETEFHGDYSKLSGIIIPGFASTQLRAWSILDCPYTPLDFN70 
PLDLVWLDTTKiLSAVNCWFKCM^ 

VEFG I EANAI VAVP YDWRL S PTKLEERDL YFKKLKLTFETALKLRGGP S IVF AHSMGNNVFRYFLEWLRL2 1 0 
EIAPKHYLKWLDQKI^YFAVGAPLLGSVEAIKSTLSGVTFGLPVSEGTARLLSNSFASSLWLM?FSKNC280 
KGDNTFWTKFSGGAAKKDKRVYHCDEEEYQSKYSGWPTNIIOT 

RMLHQLKKYVPFFVIRNIAHRSSLAGFLLYHDDPVFNPLTPWERPPIKNVFCIYGAHLKTEVGYYFAPSG4 
KPYPDNWIITDIIYETEGSLVSRSGTVVDGNAGPITGDET 

KQQPEKDGSDVHVELNVDHEHGSDIIANMTKAPRVKYITFYEDSESIPGKRTAVWELDKSGY 552 
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10 



15 



<170> Fater.tln Ver . 2.0 



<210> 10u 
<211> 661 
20 <212> ?RT 

<213> Sacc'naromyces csrevisiae 



Me^C-iy Thr Leu Phe Arg Arg Asa Val Gin Asn Gin Lys Ser Asp Ser 
Asp C-lu Asn Asn Lys C-ly C-ly Ser Val His Asn Lys Arg Glu Ser Arg 



20 



35 



Asr. His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

35 *° 43 

lie Ser C-iv Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arc 

50 " 55 °° 

Lvs Arg aso Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 

55 70 '= 

lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
40 85 s ~ 

C-iy Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 



100 



105 



45 Val Asn Phe Asc Ser Leu Lys Val Tyr Leu Asp Asp Tr? Lys Asp Val 
115 " ^20 ^ 25 

Leu Pro Gin Gly He Ser Ser Phe He Asp Asp II. Gin Ala Gly Asn 



50 



130 135- 



Tvr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 -== 



Lvs Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
55 ' 155 - lu 

He: Val Pre Gly Val lie Ser Thr Gly He Glu Ser Trp Gly Val lie 
180 133 
60 Gly ASP Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu T=p 
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OXy Ser n. Tyr Mec L» « ** »~ ^ ^ ^ ™ 

5 ^ H iS V*X « L- « « OX« - gV - "P « « S5 

^ T*r L» « *U »U «V «- «X» Ser TS. » ^ g. XXe 

245 

10 xxa OXy Tyx Trp XX. «P *- vaX «- QXn - - $ ^ "° 

OXv Tyr olu To xsr. w. Tfcr la Tyr Jg « "» 

15 " 275 ZBU 

A _ ^_ Thx Lys Leu Lys 

>■ _ . . rl ,, b.t-c Arg Aso Arg Tyr yne i-iw. 
A Is. Tyr Leu Asp Leu Glu a-| — * - 300 



290 ^ D 



20 ax, «, xi. ox, - g. H is am «-u ser OXy «u xys - cys lj» 

XL. OXy Hi. Ser « C-1V Ser «» XI. Xle ». Tyr «- - f*. 

25 v*X OXu XX. OX» 1" Pro - Tyr f y « 01, GX» « -Jjr » V- 

340 J 

, -. iv - n v ^hr Leu Leu Gly 
A« Glu His lie A.p Ser Phe Xle Asn Ala U. G.y ^ 
30 355 350 

XXa Pro ,y= V,. Pro xi; « XXe Ser OXy «« Mec „. «P «r 

35 XX, 1x1 Xe, «. « t. xxl «.= Tyr OXy J. «. «■ ~ «" SS 

■I xx. oxu „ v»x ly°s «« «» «» *j ™ sly Gly n " SI " r 



40 



3S5 390 
n Us Glu Arg 

405 

^ n,. vai - tto Glv Asp Met Lys Ser Ser 
M=r L eu Pro Lys Gly Glu Glu Val -=P - 430 

420 



50 



»an Thr Aso Tiir Tyr Gly Asn Phe He 
Ser Glu As? Ala Leu Asr. As., ftsn A5p 445 

, A _„ s- *ao Ala Phe Asn Lys An L-u Thr Met 

Arg Phe Glu Arg Asn Th_ b-- 460 

4=0 

- t„. ti 2 =er Pro Glu Trp Leu Gin 

Lvs Aso Ala He Asr, Met Tor Leu Se 480 

4 ;; * 470 

. =o- Phe Gly Tv-r Ser Lys Asa Glu Glu 

ir: Val His Giu «ir. -v- ----- ^ - 49 = 

' " 485 

. _ ... _ m __ c ar Asr- Pro Met 

Glu Leu Arg Lys Asr. - s - _ =10 

500 

Met Lys He Tyr Cys He Tyr 



Glu Val Pro Leu Pro Glu Ala Pro :- 
60 515 
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5 545 



Glv Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 

" 530 535 
A» Ser Ser Ala Leu A-n Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
- 550 333 



~i *„ riv vai ^ro Leu Val Ala His Ser 

Val Pile Leu Thr Glu Gly Asp Gly Til. ya .ro - 5?5 

565 370 



10 Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 583 
He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 

15 il. Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 

610 615 
Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
20 525 630 5- 

Leu Val Glu Pro Arg Gin I*u Ser Asn Leu Ser Gin Trp Val Ser Gin 



545 650 



25 Met. Pro Pine Pre Met: 
560 



<210> 2Cv 
30 <211> 387 
<212> PRT 

<213> Arabidoosis thal^ana 



<400> 2 _ n , i C-Tv Asn Gly Gly 

35 Val Glv Ser Asn Val Tyr Pro Leu He Leu V & 1 -o a- 

1 " 5 10 

to Gin Leu Glu Val Arg Leu Asp Arg Glu Tyr Lys Pro Ser Ser Val 

40 Trp Cys Ser Ser Trp Leu Tyr Pro He His Lys Lys Ser Gly Gly Trp 

Phe Arg Leu Trp Phe Asp Ala Ala Val Leu Leu Ser Pro Phe Thr Arg 
45 50 5 = 

M __ mv_ tv- ?.sr> ?ro Asp Leu Asp Asp 

Cvs Phe Ser Asp Arc Ha. Ma>. «eu Tyr ^y- --- so 

55 70 7 - 

50 Tyr Gin Asn Ala Pro Gly Val Gin Thr Arg Val Pro His Phe Gly Ser 

35 ' J 



Thr Lvs Ser Leu Leu Tyr Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser 

100 

^r He; Glu :-:is Leu Val Lys Ala Leu Glu Lys Lys Cys Gly Tyr Val 
115 120 



Asn As- Gin Thr He Leu Gly Ala Pro Tyr Asp Phe Arg Tyr Gly Leu 
60 130 
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Ala Ala Ser Gly His Pro Ser Arg Val Ala Ser Gin Phe Leu C-ln Asp 
145 150 1=»= 

Leu Lvs Gin Leu Val Glu Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro 
5 " 165 170 175 

Val He Leu Leu Ser Kis Ser Leu Gly Gly Leu Phe Val Leu His Phe 
180 IBS ^0 

10 Leu Asn Arc Thr Thr Pro Ser Trp Arg Arg Lys Tyr lie Lys His Phe 
19 § 200 20 = 

Val Ala Leu Ala Ala Pro Trp Gly Gly Thr He Ser Gin Met Lys Thr 
210 215 220 

15 Phe Ala Ser Gly Asn Thr Leu Gly Val Pro Leu Val Asn Pro Leu Leu 
225 230 23s 

Val Arc Arc Kis Gin Arc Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro 
20 245 250 

Ser Thr Lys Val Phe His Asp Arg Thr Lys Pro Leu Val Val Thr Pro 



260 



25 Gin Val Asn Tyr Thr Ala Tyr Glu Me, Asp Arg Phe Phe Ala Asp He . 
275 280 285 

C-lv Phe Ser Gin Glv Val Val Pro Tyr Lys Thr Arg Val Leu Pro Leu 
' 290 295 300 

30 Thr Glu Glu Leu Met Thr Pro Gly Val Pro Val Thr Cys He Tyr Gly 
305 310 31a 

Arc Glv Val Asn Thr Pro Glu Val Leu Met Tyr Gly Lys Gly Gly Phe 
35 " " 325 330 

Asn Lys Gin Pro Glu He Lys Tyr Gly Asp Gly Asp Gly Thr Val Asn 
340 34= 

•40 Leu Ala Ser Leu Ala Ala Leu Lys Val Asp Ser Leu Asn Thr Val Glu 
355 350 35= 

-i= Asa Giv Val Ser His Thr Ser He Leu Lys Asp Glu He Ala Leu 
370 " 375 380 

45 

Lvs Glu He 
385 

50 <210> 3 0^ 
<2ii> 33S 
<212> ??.T 

<213> Arabidopsis thaliana 
55 ll^lyl Lys Glu Gly Leu Lys Ala Lys His Pro Val Val Phe lie Pro 

Glv He VslI Thr Gly Gly leu Glu Leu Trp Glu Gly Lys Glr Cys Ala 

60 2 0 2 => 
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r „, , civ Glv I'h.r Phe Leu Cys Trp 

Aso Glv Leu Phe Arg Lys Arg Lau Trp exy L,xy 

35 40 4w 

Vai Glu His Met Ser Leu Asp Asn Glu Thr Gly Leu As? Pro Ala Giy 
5 50 55 

lie Arg Val Arg Ala Vai Ser Gly Leu Val Ala Ala Asp Tyr Phe Ala 
65 70 
10 Pro Gly Tyr Phe Val Trp Ala Val Leu lie Ala Asn Leu Ala His He 

Gly Tvr Glu Glu Lys Asn Me, Tyr Me= Ala Ala Tyr Asp Trp Arg Leu 
100 105 

lD Ser Phe Gin Asr. Thr Glu Arg Asp Gin Thr Leu Ser Arg Me, Lys Ser 
.115 120 125 

Asn He Glu Leu Me, Val Ser Thr Asn Gly Gly Lys Lys Ala Val He 
20 130 135 140 

Val Pro His Ser Me, Gly Val Leu Tyr Phe Leu His Phe Me, Lys Trp 

145 150 l3 = 

25 Val Glu Ala Pro Ala Pro Leu Gly Gly Gly Giy Gly Pro Asp Trp Cys 



Ala Lys Tyr lie Lys Ala Val Me, Asr. He Giy Gly Pro Phe Leu Gly 



30 



180 I 35 



Val Pro Lvs Ala Val Ala Giy Leu Phe Ser Ala C-lu Ala Lys Asp Me, 

200 205 



195 



Arg Me, Thr Arg Thr Trp Asp Ser Thr Me, Ser Me, Leu Pro Lys Gly 
35 210 215 220 

Glv as,- Thr He Trp Giy Gly Leu Asp Trp Ser Pro Glu Leu Pro Asr. 
225 " 230 23 = 

40 Ala Pro Glu Me, Glu He Tyr Ser Leu Tyr Gly Val Gly He Pro Thr 

245 2=0 -5= 



Glu Arc Ala Tyr Val Tyr Lys Leu Asn Gin Ser Pro Asp Ser Cys He 

265 ' u 



45 



250 



Pro Phe Gin He Phe Thr Ser Ala His Glu Glu Asp Glu Asp Ser Cys 
275 230 25= 

Leu Lvs Ala Glv Val Tvr Asn Val Asp Giy Asp Glu Thr Val Pro Vai 
50 290 29= j0 ° 

Leu Ser Ala Giy Tyr Me, Cys Ala Lys Ala Trp Arg Giy Lys Thr Arg 
305 310 " D 

55 ' Phe Asn Pre Ser Gly He Lys Thr Tyr He Arg Glu Tyr Asn His Ser 

22= - 3 - 



60 



, -„„ t— r-i- '--•v Glv Thr Gin Ser Giy Ala His 

Pre Pre aj.s. Asr. jjeu L=- Q~- — -•' — * — - - -q 

340 3 ' 3 

Vai Asp He Me, Giy Asn Phe Ala Leu He Glu Asp lie Me-- Arg Val 
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355 



360 



365 



-i« riv v^s Aso Gin Val His Ser 
Ala Ala Gly Gly Aan Gly Ser Asp xle Gly K-s A|P 



370 375 

5 

Gly He Pits Glu Trp 
385 



10 <210> 4(X 
<211> 1986 

<212> DNA . . 

<213> Saccaaroirryces cersvisiae 

15 <220> 

<221> CDS 

<222> (1) . . (1983) 



<400> 4 aac c = a aag agt gat tct 48 

20 ate ggc aca ccg t« cga aga a-w cag Ser Ser 

Met Gly Tiir Leu PHe Arg- Arg Asr. v^_ c-n ^ 15 
1 ^ 

aat gaa aac aat aaa ggg ggt tec get cat aac aag cga gag age ^ 
25 asp Glu Asr. Asr. Lys Gly Gly Ser Val Hi* ash 3Q 

s sr. s s ss ss ss ss s is ss ss s k s is 

30 35 40 

att agt: gac agt ffea aaa aga aac gag egt ggc aaa gat ttc gac agg 

lie Ser Gly Ser Ala Lys Arg Asr. Glu A_g Giy ^ 
SO 55 

_ a»t tec aaa aga erg 

aaa aga gac ggg aac ggt aga aaa eg. ^ jgj ^ ^ g Leu 

Lvs Arc Ks-3 Gly Asa Gly Arg Lys —Sf -=P 80 
'-- 70 

= ac - C t£ ctt ttg ccg tec age ttt 

att ttc att ctt ggt gca wtc _._a gg- ?h Sar one 

Tie PHe lie Leu Gly Ala Ptie Leu Giy Val Leu Le_ .ro ^ 

85 su 
ggc get cat cat gtt cat aat age gat age gac ttg ttt gac aac ttt 
Glv Ala Tyr His Val His Asr. aer.A,? S- A^p i- ^ 
100 - 03 



*0 



96 



144 



192 



240 



288 



336 



ctc tw aat gat tgg aaa gat gtt 

gta aat ttt gat tea c t a*a gtg ^ t ^ Trp Lys ^ V al 

Val Asr. Pr.e Asp Ser Leu Lys vai ^ • - 12 5 



50 113 i2 ° 



„ a i-i- cag ect ggt aac 

etc eea eaa ggz ata agt teg ttt a - g~ ^ ^ Gly , sn 

Leu Pro dr. Gly lie Se- s=- — 14Q 



130 i35 



. p-r- C a a aat ttt gee gtt ggt 

tec aca zee ttt tea gac gat c . ag : *a- aa ^ Val Qiy 

s ^ T .-„ Ser S er Leu Asp Asp i-eu s~ t-- r 16Q 

150 - ==> 



3 84 



i32' 



480 



60 



a „ a-a cat ect gtt gta 528 

aaa caa etc tta cgt gat tat aat a„ g-j, ^ „. s pro V£l Val 

Lys Glr. Leu Leu Are A»P =»— 
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165 



170 175 



atg gee ccc ggt gtc act ccc acg gga act gaa age egg gga get ate 
Met Val Pro Gly Val He Ser T*r Gly He Glu Se. Trp Gly 
180 - Bz> 



576 



624 



f--*- rr^^ caf etc cert aaa egg ccg egg 

gga gac gac gag tgc gat ag. tw- g-r ca- twu u 

Gly Aso Asp Glu Cys Asp Ser Ser Ala His Pne Arg Lys jug -e- W 
" i95 200 

10 _ _ rrrr r crt tcra 672 



gga agt ttt acg ccg aga - atg gjt atg gac aaa get cgt tgg 

Gly Ser Phe Tyr Mec beu Arg Th^ Met VaJ. cet . 
210 215 220 



S £1 S: SS Si? S SK - Si S liy S IK So «s « 

225 230 

IS S S S K E SIS IS - IK IS ?£ ffi S= g. - 



720 



tec ate 768 



245 



K Si? £ SS S IS si is ss IS SS L" s si; ss Si 

^<;n 26r> 



815 



864 



" sr, ^ ss S s s s; is is a s £ s s s s 

■ * 275 280 
30 aC g aag cca aag 912 



gca tat tta gar eta gaa aga cgc gac agg cac «• 
Ala Tyr Leu Asp Leu Glu Arc Arg As? ?-g -V- -«« 
290 2 95 300 



35 sx is a; ss is s is ss Sssses i 

305 310 313 



950 



«o SES£SESSaSS£SS2|iiS 1005 

325 

, s IS S! Si S£ IS SS SS £ § IS IS S5 S § S 

is it: ss s k is - si £ r. s is i s £2 is 

30 c «-« a— ast cert gaa ace aaa gat ace 1152 

m is si ss ?a is & s; »r «. «t *«. »* 

370 375 3S0 

55 „ e » f =: 5 « 2£ IT; SS IS "5 IS IS IS 1200 

He Glr. Leu Asn Tnr ueu Ala -v. v~y -»= r - 400 

3S5 390 

. „ = „ a 9 . — fjc acre g" aca cca cca 1245 

aga acc gag aga gca aa=, = u = ~ : £i ^ ?ro Ser 

60 Arg Zle Glu Arg Val i,ys Me- ^e- ^— ^- - 

405 



1056 



1104 
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«. c~ «a „ „a g. gj «*= a« g «J « I°i 12 

Met Leu Pro Lys Gly Glu Glu Vai -ie i=p 430 
420 423 
5 tea gag gat gca ttg aat aa= aac act gac aca tac ggc aat ttc att 
Ser Glu Asp Ala Leu Asn Asa Asn Thr Asp Th- Tyr u^y 

435 

10 cga ttt gaa agg'aat acg age gat get ttc aac aaa aat ttg aca atg 
Are Phe Glu Arg Asn Thr Ser Asp A±a — «■ ^ 

450 433 
aaa gac gec att aac atg aca tta ccg .« tea cct gaa tgg etc caa 
Lvs Aro Ala lie Asn Met Thr Leu Ser II- &e 480 
465 " 4 ™ 475 

*~at tec aag aat eras, craa 
aga aga gta cat gag cag tac teg ttc gg w ^ ^ ^ - Glu 



1296 



13 92 



1440 



1488 



aga aga gta cat gag cag tac teg t^.c g w " lu Glu 

Arg Arg Vai His Glu Gin Tyr Ser Phe Gly Tyr Se- Ly ^ 

gag tta aga aaa aat gag eta cac cac aaa cac tgg teg aat cca atg 1536 
Glu Leu Arg Lys Asn Glu Leu his .*a.s Lys His ^ 
500 • 503 



25 s ^ ^ c=a « ==« g. « J? SS SS 

Glu Vai Pro Leu Pro Glu .Ua Pro riis kl-u ivy ^ 

515 520 

^ eta ta* aaa gaa gag gat 
30 ccg gtg aac aac cca act gaa agg gca tau ^ - ^ q1u ^ 

Glv Vai Asn Asn Pro Thr Glu Arg Al« Tyr vai ry^ 

530 535 
gac tec tct get ceg aat ttg acc ate gac tac gaa age aag caa cct 
35 ass Ser Ser Ala Leu Asn Leu Thr ile Asp Tyr _ 60 
545 550 35 = 

„ c rc ate gcg cat tea 

g-a ttc etc acc gag ggg gac gga ac- gtj - ^ ger 

Vai *he Leu Tiir Glu Gly Asp Gly Thr Vai P-o u-u ^ 
40 555 D7D 

tea cca tac aac cct gec gga 
atg tgt cac aaa tgg gec cag gg gc uca ccg ^ ^ Qly 

Met Cvs His Lys Trp Ala Gin Gly Ala se o y sgQ 

580 5a=> 

45 _ rr & aat cga ttt gat 1824 



1632 



1680 



172S 



1776 



ssss ?i? a si s: - s; £ s 25 - - 

etc ggc age 

Ala Glu His Vai As? 1^ 

615 

... it: cca acre ggt aat ggc gat 

reg gag ttg aac ga^ tac - Gl * sn G iy Asp 

55 Ala Glu Leu Asr. Asp Tyr ne x-*u Lys — 640 
525 ^30 

„ r— — a c" ca r tgg gtt tct cag 
zzz gtc gag cca cgc caa ttg tct aa. --- » x Ser Gla 

Leu Vai Glu Pre Arg Glr. Leu Ser Asn Leu S_. C— . 
60 S45 «° 



lie ^ — ^ „ „ 

CC ; 600 



so .„ „» S ; |f; ~; «s 'K S IS !5 

lie Arg Giy Gly Ala Lys Ser Ala Glu .-s vs.- ^ 
610 61 - 



1872 



1920 



1968 
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1986 

atg ccc ttc cca atg taa 
Met: Pro Pb.e Pro Mec 
660 



10 



<210> 5 ou 
<211> 661 

<212> ?RT . . 

<213> Sac char amy ces csravisiae 



Meekly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
1 5 -° 

15 Aso Glu to Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 

20 23 

Asn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

40 * = 



20 



35 



He Ser Glv Ser Ala Lys Arg Asn Glu Arc Gly Lys Asp Phe Asp Arg 
50 " 55 50 



Lvs Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg 

65 70 ' 5 

lie Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 



85 



30 Glv Ala Tvr His Val His Asn Ser Asp Ser As? Leu Phe Asp Asn Phe 
100 105 
Val Asn Phe Aso Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 X2d 

35 Leu Pre «n Gly He Ser Ser Phe Zie Asp Asp He Gin Ala Gly Asn 
130 135 140 

Tvr Ser Thr Ser Ser Leu Asp Asp Leu Ser C-iu As* Phe Ala Val Gly 
40 143 150 15 = 

Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 



165 



45 



50 



Mec Val Pro Gly Val He Ser Thr Sly He Glu Ser Trp Gly Val He 
1B0 135 

Glv Asn As= Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
" 19 " 5 200 20j 

Glv Ser Phe Tvr Met -Leu Arg Thr Mec Val Mec Asp Lys Val Cys Trp 
' 210 215 "20 

Leu Lvs His Val Mec Leu Asp Pro Glu Thr Gly Leu Asp Pro Pre Asn 
55 225 " 230 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 ~=° 

60 Ala Gly Tyr Trp He Trp Asn Lys val Phe Gin Asn Leu Gly Val He 
250 -== 
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Glv Tyr Glu Pro Asn Lys Me- Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 235 

Ala Tyr Leu Asn Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 " 295 300 

Glu Gin lie Glu Leu Phe Kis Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

He Glv His Ser Met Glv Ser Gin He He Phe Tyr Phe Met Lys Trp 
325 " 330 335 

Val Glu *la G 1 u Glv Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 " 345 350 

Asn Glu Fis He Aso Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 " 360 365 

Ala Pro Lvs Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 " 375 380 

He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
335 390 395 400 

Arg He Arg Val Lvs Met Leu Gin Thr Trp Gly Gly He Pro Ser 

405 " 410 415 

Met Leu P-o Lvs Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 
420 425 430 

Ser Glu Asn Ala Leu Asn Asn Asn Thr As? Thr Tyr Gly Asn Phe He 
435 440 445 

Arc Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

Lvs asd He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 

465 " 470 475 480 

Ara Arc Val s C u Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
435 490 495 

Glu Leu Arc Lvs Asn Glu Leu Kis His Lys His Trp Ser Asn Pro Met 
" 500 505 510 

G* ■ ■ v»"> *-o Le^ *ro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 
515 . 520 525 

Glv Val Asr Asn Pro Thr Glu Arj Ala Tyr Val Tyr Lys Glu Glu Asp 
=30 525 540 

Asr; 3e- Al= "^u Asn Leu Thr He Asp Tyr Glu Ser- Lys Gin Pro 

¥*5 " ~ =5Q =55 550 

Va- oh, t.o„ -^r g'u Glv Asr; Glv Thr Val Pre Leu Val Ala His Ser 
5i5 570 575 

Met Cvs His Lvs Tm Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 * 585 590 
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lie Asn Val Tiir He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 60 = 

lie ato Glv Gly Ala Lys Ser Ala Glu Kis Val Asp He Leu Gly Ser 
610 * 615 ° 20 

Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 S30 635 

Leu Val Glu Pro Arc Glu Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 S3 ° 

Met Pro Phe Pro Met 
660 
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<110> £ivmr.= Dr., Seen 
<120> 



<I30> 
<i;u> 



<210> 1 £r 

<2ii> isas 

c212v Genomic DN7\ 

-5212 > Saccharonycss ctrsvisiae 

c220> 

<222> CDS 

<222> (i) . • C1S£2) 

< ' 10C> <-t~ crs a^c cia asg sgt 5" - cc 40 

«. J- «; cln Lys .„ ^ ssr 

KftS Gly Tt-.r LCU Pne AT= — = v «. ^ 

5 10 

caa «c aaz «a S « SS* c« g« »« ^ ^ ^ " 

L 7 _ C-l, A« A.n Ly= Oly Gly .« Val Kt» Ly. Xr. Olu Se. 

20 25 

_ ( , c=:= cat aag as = X ^ 

~ si si si si si 2 st; £ s, - ~. - - - •«» 



25 40 



_ , arc £d2 eaw tC= aac a=g 

=*= ^ *= a ~ : sa ::; «: y x. v . p*-- *r S 

lis s=r Gly ssr Alt Lys ?.-= A=n Gl- A-= <^ - 



SO 

SO 



, , a =a cat ccc aga aga = -5 

a.,a aga gac ggg aac ggi aga == - ; 

Lv. ? .rg Gly *.» Gly « i.y. M *=? >-= " s0 

r ee - ==a etc gs- - 5 C= = C " !" S ;!! 

-is ?'-= Ha Leu Gly A.\a Pas le- ^ G _ 

r, 



35 



S g = ffSS « ft „ s ~= - - *~ ::: H: £ 1^ - - 

3iv >.la Tyr Kis Val Si. A-r. «« »e- As, — 



132 



240 



2sa 



225 



200 
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e*a a- «* "a etc .a, 5 -S »t «s S« =at t W gat gtt 

Sw£ * — 5 e T . v . val Tyr uu X== AS? Tr? Lys AS? Val 

Val Acn. ?hs Asp Scr Leu vai Ty. . ' 

IIS i2 ° 

cue c=*. eu g=- a — a S w - _ cin Al . Giy M 

Lcu.Pro Glr- Gly Sle S = r Scr Pw 2-e * s ? AS - 
130 

_ „„ _._ - aa sat £t- S== =- = S=t 

,a= t== a = a «c ccc «a J- - - ph£ ^ Val Gly 

Tyr Scr Thr Ser Ser Leu As? *. 1£Q 

14 S ISO 

.u c« c« e*. eg, cat a« .« ^ .« £ c« J- ;; . 

Lv, Gin Leu Leu Ar= MP Tyr Asr. rie Glu *±» "" S 
165 



3S4 



43 2 



450 



, =3 s == «, E -= - £• ::; ;!! £ Hi 

M.-. V.1 tr= Gly VH H= S«r TRr Oly «<- - ^ 



2.30 



;-g car etc est **» cs ' C52 



,.a 9 ac SaS -S= -O- ^ Ar „ lya xr£ L£U Tr? 

Giv As? Asp Glu Cy= AS? Ser Sc. 

195 200 



= S£ agi =a= at= ctg afft a— -- - 

Giv Ser ?r-e Tyr Met Leu Ar= Tar Ms- Vai K=- A. -/ 
210 2 - 5 

- ^tc crac csa ccg z-zlz: 

« a «- s « tc S «» =- -r- ,_ s « ra 

t.ou Lys Ki. V.1 Ms — AS? Pre Glu T,r Gxy Le- ~? - _^ 



=.t= 5 tt. ass sat aas gcr t=t 



S2. 



£72 



720 



aa5 

«, *=S ==a c„ g=a =ca ca S 2 g= --tc S « «a act gat cat r« acc 
r-h= T'r.r Leu Ar, Ala Ma Gin C-ly Phc Glu Ser ~P 

I:: si £ ;s r. - s ^ ^ =" - ~ % ~ - 

55= ^ .„ .« « ; - ;! - :!! ~ Z; ;!! 2; £ 

■3iv Tyr Glu ?r= Asr. i-ys Ms — 

27 5 350 ••. 



= 54 
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5N cat «a gat c = a ,3^ csc ga- *« »c «c a=S «s cu 

il* Tyr Leu Asp L=u Clu Ar= l«ro x«p *r 5 Tyr Pne Th. L, - Leu I* 



2SC - s= 



ga£ c« «c gaa ct S . t« "C =aa ccg ag- «t S£ £ ^ ^ ^ 
Slu Gin lie Glu Leu Phe His Bin Leu Ser Oly Glu L-y- *y. ^ 

303 310 

e -t t — ca= net a== tct "3 aaa 

ace s« cat tec ac S ==t c- « ^ Tffp 

lie Glv KiE Ser Met Gly S« Gin *le -1= - - 



9 12 



9 50 



^ c ww tac c=£ aa t egc ggt =5" SS= C5S SwC 

etc gag gcu gaa S3= c a " x ^ Gly tr? Val 

v*l ciu Ale Glu Gly ?r= Leu Tyr c-ly A-n Gly Gly . - 



340 341 



« g* ? «c aca B « - etc att - ,« «■ £ = ~ «J £ 

A=rv Glu Ki= =lc A=? Ser Ph= ,1* A*« ~la Al- / ^_ 



355 350 



e - a att a=t eg-. S aa ac 3 g*._ 
see cea aac g=a g-- = A Ttr 

Ala Pro Ly= Ala V- Pro Ma Leu He _e. G-y 



370 275 



—- a »ac C" ttc cca 

c« .« ™ «» « r.u :» ^ »» «- .« 

11= Gin lm Mr. Tiir Leu Ala Me. Tyr c_ ^ 
325 390 

t . — e-srt eg: a==. e=a tea 
a« W a*a S ~ -a at* «a c« ^ -* ^ ^ ^ 

Arc lie Glu Arc Val 1*3 Wee Gl*. T . ^ 



100B 



1055 



1104 



1152 



1200 



405 



. = a » =a= act egg H=S =a, atg tea est 

a=5 ks a*s SB- S a - - . M= - Lys S er Ser 

K«r Leu Pro Lys sly Glu Glu V.l ,=P Gl> ^- ^ q 



^12 0 



,=a gag ga, g=a - .ae ac, ga= aca ca= ^ - - 

Ser Glu a«p Ala Leu x.a A«n Tbr ««P T«- Ty. 



435 440 



sa - us. 2-at i-g a« acg 

c= ? . ga ? . agg aa= acg ag= g^- Asi L . = u Thr Met 

,„ c-lu ^g xsr. Thr Ser Asp ^1. ?r._ X= 



450 



- - a csi caa egg =tc caa 

«s cce-etr aa= atg ata tta teg -^^ ^ 

Lys Asp Ala He M= — ~ * - 4«0 



1296 



13 44 



13S2 



144 0 



- 7S 
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aga sga gta cat gag esc tac teg ste = = = sat «=« aae sac « £ eu 145C 
Arc Arc V e .l :-iis Glu Gi^ Tyr Ser 5hs Gly Tyr Ser Lys f.sr. Glu Giu 
40= 4S0 4 55 

gac ecu aga ana aa: gag eta cat ci: asg «: tg= teg aat cca atg 153 5 
Glu Leu Arc Lys Asp. Glu Leu His Kis Lys His Tr? Ser A=r. Pro Me: 

coo 535 • S1 ° 

c i5 cis ccs est ccs. caa cc: ccc eac ate a" acc tat t=t ata tac 15=1 
Glu val Pre Lsu Pro Glu Ala ?rc Kis Ks: Lys lie Tyr Cys lie Tyr 
£15 520 - 2S 

ccc cc S «■= ?.« act gw. age gca sat gta tat aag gaa «c cat 1S22 

Gly Val Asr. Asr. Pro Thr Glu Are XI a Tyr val Tyr Lys Glu Glu As? 
530 S3S 5 40 

gac tec set set etc aat ttg acc ate gac tac gaa acc s== eae. cc. 1GSC 
Ld Sir £er Ala Leu fcsn Leu Thr Sis As? Tyr Glu Ser Lys Gl" Pro 
545 530 5=0 

eta ttc etc acc gag ggg «c g=a acc est ccc etc gcg gcg eat tea 
val Phe Leu Thr Glu Gly Asp Gly Tiir Val Pro Leu Val Ala His Ser 
s -= 370 57S 

ate set cac aaa ccc ccc cag get get cca ccc tac aac est gee gee. 177£ 
He: cvs Ki= Lvs Tr: Ala Git Gly Ala Sar Pre Tyr Asr. Pro Ala Gly 
5=0 ££5 250 



att aac est act att gtg gaa atr aaa car cag cca gat cg= ttt gat 
lie Asn Val Tlir lie Val Glu Met Lye His Gin Pro As? Arc ?he As? 
555 600 505 

iica cgs =gc «a gca saa age gee eaa cac gta gac ate etc S gc »S= 
lie Arc Gly Gly Ala Lys Ser -Ala Glu His Val As? He Lsu Gly Ser 
£10 SIS £2 ° 

ccc cac ssg aac oat tae ate rtg aaa att gca age ggt a=C gge cat 
Ala Glu Leu ash A==. Tyr lie Leu Lys Xie Ala Ssr Gly Arr, Gly A=? 
£25 530 €S£ "° 

etc gfcc =e.c cca esc 



17 2.8 



lc'2 4 



1672 



1320 



c&s. ttc ret aat "c sgc c.c t:gg gtt tc: cc= isss 



Leu. Val Glu Pre Arc Glu Leu Ser Asr. Leu Ssr Glrv Cr? Val Ser Glr. 
£45 
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ccc tec cca aeg taa 
Pro Phs Pre Met 

550 



PCT/EP00/02701 
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<210> 2 

<211> SSI 

<212> PRT 

<213> saccharomycss cnrevisiae 



< 4 0 0 > 2 

Hat Gly Ihr LCU the Ar 5 Ar= A« V»! Gl, A« Glr. Ly= «• = 

5 10 
Clu A=, Asn Ly= Oly Gly val His A» I.y. *r 5 Clu S« Ars 

A.«. K i« Xle kIs Ki. Gin Gin Gly X." Cly His Lys Ars ^ « 
lie ».r Cly s« ,1, Ly S **e ~» =^ « C ^ " P ^ W ~ S 

Lys ArJ X= ? Gly Oly Ar S «^ « Trp Arj A, P Se= Ar* Ar S **« 

l" P"« lie Gly Ala Ph= Leu Oly Val L« Uu P» Ph* Ph. 

S3 90 

Cly ,1, Hie val Hi, ^ s.r a=p sc, A=? L.u *» *»P — ^ 

Val A=, Plu, *eu lor- Val ^ L-u W «P «p Ly = « val 

L.u Pro c" Gly XX. .« .« - XI. *op «. Oln Ala Gly As, 

Ty r Ur xnr .« Z..U - ? X-u S, Gl. X» Ph. - -1 Gly 

£ ai,. x,~ Leu „ « ^ XI, ai. ia ^ ^ *~ Val 

T1 c „_ T w- G w Xle Clu sir ?r? Gly val 11= 
Ket Val Pro Gly Val He Se- 1~ C-y x 

-i as 

120 „ «-.- e— lis Ki« Ar= Lys Ar S Leu Tr? 

Gly As ? ?.sc C-lu cys A3? Ss- -S. Ala 
_ 2. 0 0 

- 25 ^, ^ m-- >s- Lys Val cys Tr? 

Gly sar ?r. S Tyr Me"- Leu Arc Thr Met ^- ~ ^ 

210 ^ G i v L*u Aso ?rc ?rs Asr. 

Lys Xia Val Ket Leu *«P ^ ' 240 

225 , !?° c ,„ G w Clu S=r r.-.r As? Tyr Ph. He 

Phs Tiir Lr- Ale Ala Glr. G.y z5 = 

„. Tv - v»i 'he C-lr. AS^ Gly Val 21* 

Ala C-iy Tyr Trp Xle Tr? >.sr. l,y. -n 

2SC - v _ Ala Tyr Asp Arc Leu 

Gly Tyr Slu :-rc Asr. Lys Ms: r ~ _ gs 

275 , !ll a-s T-^r Phft Tl-Jr Lys Leu Lys 

Ala Tyr Leu As? Leu Clu Ar= ~r= — -- — - - ^ 

250 „, . ,. e - siv Slu Lvs Val cya L=u 

Glu Glr. Sis Clu Leu Ph. His Glr. w-, ,„ 



2C5 210 
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• e v . civ =- G^n lie lie ?hc Tyr ?he K«c Lys Trp 

life Civ Hi 8 Ser KsS Gl> 235 

2 3 0 

v... M . -u o" «. ^ »» Trp v "' 

»„ «. «. it: »p .« ». »• «» " y ™ ~ u oly 

U , .„ £ ». V.1 «. XI. »- - •« ^ ^ ^ 

375 

370 .r,... riv wu Glu Lys PhC Fhs Ser 

He Gin Leu Asn Thr Leu Ala Met Tyr Gl> L_u Gi y ^ 

365 390 T Ts r T-o Gly Gly lies Ser 

Ar= lie Glu Arg Vel Lys Me: Leu Olr. -hr T.? Y ^ 

T . Tv - " y Glu Glu v»l Xle Trp Gly Ae? M« Ly 5 ser S.r 
Kec Leu Pre Lys Giy u - u _ 4S0 

"* 20 t~ 3 *=.n Th- Tyr Gly Jtan P&* He 

= er Glu A=p Ale. Leu A.rv Asr. Asn *«r A 3 p *Y- ^ 

435 r° 1,1. -he >sr. Lys Asn Leu Thr KeS 

Arc Phe Glu Arg Asn Thr Ser Asp Ala As* ^ 

420 455 T c „- ti. Se- Pro Glu Tr? Leu Gin 

V/8 A= = Ala He Asn He: Thr Leu Sc- -1- Se. ^ 

;:: v.i Kis giu ^ .. r 5 - ; y ; - - « - «« 

Glu Leu Ar 5 Ly S £ Giu Leu Kis Hi* £ ^ ~P Ser A- Pre He* 
val Pre Leu « Giu Ala Pro £ « X*. XI. xvr cy. Xle Tyr 
Gly val M A» P« *hr Glu Arg Ala Tyr Val Tyr Ly S Glu 

53 - rl se ~ XjVs Gin Pro 

r-^ r rK r * lie AS? Tyr Giu se- <u_ 
As? Ser Ser Ala .-.sr. u— -a. - _ . = 50 

Vel Phe X.eu Thr Giu Gly *«p C-ly Tte Vel Pro v.. 

570 

565 r,-^ tv— Asn Pro Ala Gly 

,.e, cys Kir Ly S Trp Ale Gl» Gly Ala Ser Pro T>. As. ^ 

550 - - 1V1 M . e Kis Gin ?r= Asp Ar S 

lie Asr. val Thr He v ai Glu Kec x*3 

sss 600 „• -,,=,1 T le Leu Gly scr 

II. >-r 5 Gly Gly Ala ^ ser XI, Glu K» val W 

S10 61=1 ■ Gly Asr. Gly As? 

Al, Glu Leu ^ W Tyr Sle Le, Ly S xle X„ — 

625 "° T c _ a- ^ Ser Gin Tr? vel £^r Gin 

Leu Val Giu Pro Ar= Gin Leu Ser A £ » £S£ 



Ms: Pre ?iie ?rc Me: 
£50 



SUBSTITUTE SHEET (RULE 26) 



WO 00/60095 



40/53 



PCT/EROO/02701 



<21C> 3 h 

<211> 2312 

<21?. > gcn.orr.ic DMA 

<213> Schiiosacc ha rornyc £ S .pc-ic 
«400> j 

ATCGCGTCTT C C AAG AAO AG CA.ww CT CAT ^G^.-^AGA AAGA?VGTCAA SO 

ATCTCCTATC GACTTACCAA ATTCAAAGAA ACCAACTCGC GCTTTGAGTG ICO 

AGCA^CCTTC AGCGTCCGAA AGACAATCTG TTTCAAATAA ATCAAGAAAA 15 C 

TCT *AATTTG G AAAAAG ATT G AATTTT ATA TTGGGCGCTA TTTT G G G AAT 2 00 

ATCCGGTGCT TTTTTTTTCG CTGTT GGAGA CGACAATGCT GTTTTCGACC 25 0 

CTGCTACGTT AG AT AAATTT GGGAATATGC TAGGC7CTTC AGACTTGTTT 3 CO 

GATCACATTA AAGGATATTT A7CTTATAAT GTGT7TAAGG ATGCACCTTT 35 Q 

TACTACGGAC AAGCCTTCGC AGTCTCCTAG CGGAXATGX^ GTTCAAGTTG 4 0 C 

G7CTTGATAT GTACAATGAG G G AT AT CG AA GTGACCATCC TGTTATTATG 45 C 

GTTCGTGGTG- TT ^ TC ^ GC TC AGGATTAGA^ AGTTGGTCGT TT^TAATTG 5 0C 

CTCGATTCCT TACTTTAGGA AACGTCTTTG GGGTAGC7GG TCTATGCTGA 55 0 

AGGCAATGTT CCTTGACAAG CAVTGCTGGC TTGAAGATTT AATGGTT GAT £0 0 

A^AAA^CCC- GCTTGGATCC GAAGGGAA7T AAG CT G GG AG CAGCTCAGGG £5 0 

GTTTGAAGCA GCTGATTTTT TTATCACGGG CTATTGGATT TGGAGTAAAG 7 00 

TAA7TGAAAA CCTTGCTGCA ATTGGTTATG AGCC7AATAA CATGTTAAGT 75 0 

CCTTCTTACG ATTGGCGGTT ATCATATGCA AAT TT AG AGG AACGTGATAA £0 0 

AT ATTTTT C A A-,GTTAA^A TGTTCATTGA GTACAGCAAC ATT G T A C ATA =5 0 

AG AAAAAG G T AGTCTTGA7T TCTCACTCCA TGGGTTCACA GGTTACGTAC BOO 

■ TATTTTTTTA AGTGGGTTG A AL-'JTL-AGCGC TACGGAAATG GToGACCGAC D5 0 

TTGGGTTAAT GATCATATTG AAGCATTTAT AAATGTGAGT CTCGATGGTT 10 0 0 

CTTTGACTAC GTTTCTAACT TTTGAATAGA TAT C G GG AT C TTTGATTGGA 10 5 0 

CCACCCAAAA CAGTGGGAGC CCTTTTATCG GGTGAAATGA AAGATACAGG 1100 

TAT7GTAAT7 ACATTAAACA TGTT AAT ATT 7AATTT7TGC TAACCGTTTT 115 0 

A\GCTCAATT GAATCAGTTT TCGGTCTATG GGTAAGCAAT AAATTGTTGA 12 0 0 

GATrUi*r*rAC TAATTTAG7 G TTT AG TTT G G AAAAATTT TT TTGCCCTTCT - 3 =. 0 

GAGGTATATT CAAAAATACA AVTGTGCTCT ACTTTTTGTA ACTTTTAATA 13 0 0 

CAGAGCCATG ATGGTTCGCA CTATGGGAGG AGTTAGTTCT ATGCTTCCTr. 13 5 0 

AAGGAG G C G A TGTTGTATGG GGAAATGCCA ' GTTGGG7 AAG AAATATGTQC i-to C 
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TCTTAATTTT TtmAATW TTAGGCTCCA GATGATCTTA ATCAAACAAA 
TTTTTCCAAT GGTGCAA.TTA TTCGATATAG AGAAGACATT GATAAGGACC 
ACCATCAATT TGACATACAT GATGCATTAC AATTTTTAAA AAATGTTACA 
C ATC ACG ATT TTA^-GTCAT GCTAGCGAAA AATT^TTCCC ACGGTCTTGC 
TTGG ACT G AA AAAGAAGTGT TAAAAAATAA CCAA^TGCCG TCTAAATGGA 
TAA ATCCG CT AG AAGT AAG A ACATTAAAGT TACTAA^TTA TACTAACCCA 
7\J\TAGACTAG TCTTCCTTAT GCTCCTCATA TGAAAATTTA TTCCSTTCXC 
CGGGTCGGAA AACCAACTGA GAGAGGTTAT TATTATACTA ATA^CCTGA 
GG.CCAACGT GTCATTGATT CCTCCGTTAA TGATGGAACA AAAGTTGAAA 
ATGTGAGAGA ATTTATGTTT CAAACATTCT ATTAACTGTT TTATTAGGGT 
ATTGTTATGG ATGATGGTGA TGGAACTTTA C C AA.T ATT AG CCCTTGGTTT 
GGTGTGCAA.T AAAGTTTGGC AAACAAA>-AG GTTTAATCCT G CT A7VTAC AA 
GTATCACAAA TTATGAAATC AAGCATGAAC CTGCTGCCTT TGATCTGACA 
C^GCACCTC CCTCGGCAGA ACACGTCGAT ATACTTGGAC ATTCAGAGCT 
AA.VTGTATGT T C ATTTT ACC TT AC AAATTT CTATTACTAA CTCTTG^AT 
AASGA^TXA TTTTAAAAGT TTCATCAGGC CA.TGGTGACT CGGTACCAAA 
CCCTTATATA TCAGATATCC AGTACGGACA T^GTTTTGT AGATTGCXAT 
T*ACTAA=TA ACCGAACAGC GAA^TAATAA ATG.^ATAAA TCTCGATAAA 



14 50 
1S00 

15 5 0 
1SO0 
i£50 
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<2lo> 

<112> genomic DN* 
c-.13> Aribidoosis trial! 
^400* 4 



ana 



. ICCCCOT , -c™. =c o,c^=- cgacgccgcc 

.xrAG'ATTC C-CAXAAGAAA CCACACOWtt 

JVTCTSfcMSAC C-TGGTGCACG ATSAfi-ATxv. - 

„ Ae UATOAMG OMOWGCtt GTGGTCGTGC 
'•TTCCA^TC CCACCATAAG WATCGA^- 

™~ — 

^ — — — TCCCI 

«««Tect t^.=~- CTG== , T „, 

UUMBB ™.VM=»* «XT==X«T = 

, T ^=c — — ^.n; 

r;^; 

TGTTTGTCTG TCTT^ ^ 
OX^GGC. ,C 3 ^ C^TTXOXTX ^ 

,™ — gto.tcttat — 
^=ccx,c-, „~*~ — = ! ^ 

C^V^C- CTTT««~ =— ' — G ^ 

CIXCXTCTCX GG^GTGXC- «C««CX« 
TGGGAATA.GA G^-xtf. CT~ -.—rr^V} 

—~--rr> rsxtrcMS wctaixtot gt^t.^g 

—~r T CW3CTCWCT5 T«wT*--C — 
CTA6TTCCTT TTT-A-- * -1 —-rr- ■«* 

CC^bWCXTT C ACATGT CT" GAAT.TC— 

TCAATTTCAA ?.GTTT--. ' 

c „ „~ — ; 
— — ~ : „™ 

™ = = ; 

- G -r-GGGTGGAA CAC-^--- 

,«xttktct «to«t=». - - " „mi 
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TC*K*=«a XCCCTSCXG* CTACTTTGCT =CX===X».CX XX=XCX=«= 
WKKOT CCX».C=TTO ««*«»<>■«««»» 

M «^ G c x»to««o ««««« xxcxccxc — — 

TCITTCxi XXXCXCXXCC XXSTXXCCXX XCXXTCXXC* 
™«*T«0 XXTCXXOXXS 

«««t«= ot«»t»« cxxsxxcaxx tctacc^cc 

MCMTW™ CXXCCSCXTX C=,T====« CTXOXX^. 
CT,C,TTTT,. X=~=X=C=T OCXCCXCXCC »»» 

, XC==C= M XX TGSXSXOCXX «T« A TT»* CCCOSXC^XC ^TXSGXG 

cxccxrrxcx xscxcxxcc* xxxccxsxxc «===cxxxx cxcx=cx W . 
a««~«o xr»=™c ^ ««««« 

X=,XC,C-C CZTXCSCXCXG »»«»». XX*XXCX*=X «*X«=**X 

tc~.t»c» xxgcxxxxxt xxcccxscx* ct»c«t« 
^cxccxxx xoxxxcx^ xoccrax =™c~ .x=,x^xx 
c^xxcxx ,=c=xxx=ct cx=x=*x=*c -xxxcxx* =cxx=c*c W 

__-t tocmhc r.TXQW>.XTXX GCXXXGSX"* 
7AACS.TXTGX CATBTTOT-T lTT~AI»u 

««»C XXCCXXCCXX CCXXXCXXX, e=»TT«C. XX.XXCXCC. 

.cxxx=«xx xxcxxaxxx, tccx^ cxcxxxxxxx xcxxcxxxxx 
x,,^cxc= xx==*xx M x x==xcx*xxx .xcc^cx xcctx^x 

, =AiCTTTM XXXCXXC^ »T»«« C^CCCC »«T«tX 

x*xxxxcx=x xc^ccxxc c«k»»»k 
-xcxxxcxx 

„««c» =cxx™ ««««x — - — "° 

„ tsOT «X ,=X,TC^. ««,X,XCX XXX=3^= 

;..,x,=c, 3? . ccrococcx x=x^,xx, xxxxxxxxsx xxxxccsx, 
^,x,x, t«x *~» x 
xx,x,x=xxc «™«^ cx=— ,=xcxc»^ ««*««= 
ccxgxc— — ~» ««= — ; 

xscx^xxc x^cxx^ 
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CTXTAGXTCT XCTXC.TTXT GTTOCTCeXA XGXTSXTGCC GCGTGGTGCC 
GCTCXTTTCT CTTXTGOXXT TGCTGXTGXT TTGOXTGXC* KAXGTXTCX 
XGXTCCCXA* TXCTGCTCA* XTCCGTTXGA GTGXTTTCi* 
OXSTCCXXCr OTMCCTXCC. :CC^ TTXTC*OTCT TTTTGTTXTC . 
GGYCTTGTTG OXTXtCWTTT rCXGCTC^ GCTT A C^= CTG^CTG, 
GCCTTTCTCX XXXXGGCTTG CtOU^T. TTG A G=XG=X ^GTTGXTX 
C ATG TG XCT C XTGCTTXTXX *TCCTCC=T, XGGTTXGTTC TO C««TCX 
OXTTXCCOXX ^CTCCTOXG XTaOXXXTeT XCTCXT»TX CGGAGTGGGG 
XTACCXXCOO COTATXCXX* CTTXXCCXGT CTCCCGXCXG 

TTGCA.TCCCC TTTCAG/VTXT T«cn«C ^ OXCOW^TX 
GCTGTCTGXX XOCXGO*** TXC*X,«ao XWCOOXTOX XXCXGTXCCC 
OTCCTXXflTC, CCOOOTXCXT OXKaCAXXX GCGXGGCGXG 
XTTCAXCCCT TCCGOXXTCX AfiXCTTXTXT XXTCXCTC.C 
CGCCGGCTX, CCTGTTGGXX GGGCGCGGGA CGCXGXGTGG XGCCCXXGTX 

b «xtcxt« cxxxcrrrac tnaxTcaxx oxtxtcxtox gccxtgccgc 

CCGXGGTXXC aOGTCtGATX CCGGTCCXC TCTOOCX^T 

T-rtAXTGGTC GGXGCGTXT7 GXCCTGXAGC TGTGX 
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<210> 5 lr 
<211> 402 
<212> CDMA 

<213> Krabidopsis chaliaas. 

<*20> 

<221> CDS 

■<222> U20> . . (402) 
<400> S 



11 S> 
157 



-» w w — 

MWM^CtCTTTCTCrcr =T==».=T=ATCT^C^TCC CT^TCTCTSTTCT^TTC 
CK^TTiaC^ TCCOTMMCTTWCIGCt „ ttm C«iT5»a»T 

55 E 51; SI S & S= £ SS £ SS 5S S SS 

ss s si s: ;i? s ~i s: ss s s: e - s k 315 

**r- vt** Trn XXC GGK GGA GGG AAG TGG 2 53 

it. m m is ss si £ ^ s £ aw "v oly «, 

T c= t=c m o,r xcr tst.tc wo ttc „t « tjt =t= t=t „» .== • ,n 

Sci Cy= lie Asp ssr Cys Cys Tr? Pne my - 

>>r GCA p.TG CCT GCG ACC TTC CCT 

£ K S SS S3 SS - «• ~ 

e» T» o» «o »» cc= jvr eje «. «c ttt jce «x ccc 3 

■ Glr. Tyr Vsl Thr Glu Pro As- .-lis . .r 



2S9 
402 
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48 



36 



<211> 643 
<212> cDMA 
<213=» Zea mays 
<220> 
<221> CDS 
<222> (1) . . (402) 
<4Q0> 6 

CGG GAG AA* ATA GCT GCT TTG >-^G GGG GGT GTT TAC TTA GCC GAT GGT 
Ar= Glu lie Ala Ale. Leu Lys Gly Gly Val Tyr -Leu Ala Asp Gly 

Is' 10 . 15 

GAT GAA ACT GTT CCA GTT CTT AGT GCC GGC TAC ATG TGT GCG AAA GGA 
Ac-d Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Me: Cys Ala Lys Gly 
20 25 30 

TGG CGT GGC AAA ACT CCT TTC AGC CCT GCC GGC AGC AAG ACT TAC GTG 
Tro Arc Glv Lvs Thr Arc Fr.e Ser Pre Ala Gly Ser Lys Thr Tyr Val 
35 * 40 45 

AG A GAA TAC AGC CAT TCG CCA. CCC TCT ACT CTC CTG GAA GGC AGG GGC 
*r= Glu Tyr Ser His £cr Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
£0 S5 « 

>ff CAG AGC GGT GCA CAT GTT G AT ' ATA ATG GGG AAC TTT GCT CTA ATT 
Th~ G^n Se>* Gly Ala His Val As? He Mec Gly Am Phc Ala Leu lie 
S5 " 70 75 SO 

CAG GAC GTC ATC AGA ATA GCT GCT GGG GCA ACC GGT GAG GAA ATT GGT 
Glu A=d val Tie Arg He Ala Ala Gly Ala Thr Gly Glu Glu Tie Gly 

E5 SO 

GC^ GAT CAG GTT TAT TCA GAT ATA TTC AAG TGG TCA GAG AAA ATC AAA 
Glv >« n Gin Val Tyr Ser As? He ?h£ Lys Tr? Ser Glu Lys He Ly= 
100 10= 110 

TTC AAA TTG TAA C CT AT GGG A A GTTAAAGAAG TGCCGACCCO TTTATTGCGTTCC 

Leu Lys Leu 
115 

AAAGTGTCCT GCCTGAGTGC AACT CTG GAT TTTGCTTAAA TATTGTAATT TTTCACGC 44S 

TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGACGC CAATGCGATA CG ATG TTG 5 07 

TACCGCTATT TTCAGCATTG. TATATTAAAC TCTACAGGTG T AAGTTG CAT TTGCCAGC ScS 

TG AAATT GTG TAGTCGTT TT ' CTTTACGAT7 TAATANCAAG TGG CGG AGC A GTGCCCCA 523 



152 



210 



2SS 



i3G 



2 SI 



AC^TAAAAAA A AAAAAAAA A 



543 
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<211> 115 
<211> PRT 
<?.!!> 2ei mays 

<400> 7 



xr 3 Glu Lys XI. Ma Ala Leu Lys Gly Gly Val Tyr Leu Al. Ar? Gly 

A.p Glu Thr val Pro v.l Leu £ ,r Ala Gly Tyr Met cys Ala Ly= Gly 

20 2j 

Trp Arg Cly Ly= T'nr Ar 5 PKe Ser Pro Ala Gly Ser Ly= Thr Tyr val 

s ^ ^ 

Arg Glu Tyr Ser Ki. S.r Pro Pro Ser Thr Lou !,.« Glu Gly Arg Gly 



50 53 



Thr Gin Ser Gly Ala Kis Val Arp lie 



He: Gly Asa x — 



€5 



70 



75 



80 

Glu Val XI. ATS He Ala Ala Cly Ala T>.r Gly C-lu Glu 11= Gly 

es 5 

_ r t-t- s»- Glu Lys lie Lys 
Gly Aso Gin val Tyr Ser As? XI. Phe Ly S — o r 

100 10:s 



Lsu Lys Leu 
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<210> 
<2L1> 616 
<212> cDNA 

<213> Ksurespora crasaa 
<400> S 

GGTGGCGAAG ACCAilGGCGG AAGTTGGAGG 
AGATGGATCT ACCCTCTAGA GACACCACTA 
CTNTACNGTT TNTATGGGTA GGAAGCCGAC 
TGCCGCCCGA TCCCGGGACG ACAACGCATC 
ACTTTGACTN AGGGCCACA7 TGACCACGGT 

TGGCACXGTG aaccttatga gtttggggta 

AAATG AAG AG ATACAATCCT GCGGGCTCAA 
CCGCATGAAC CAGAACGGTT CAATCCCAGA 
CTTAA.ft.TATG TAGAAAAGGT TGAAATTTAT 
ACATAGGTTA CTCAATAGTA T G ACT AATT A 
AAAAAAAAAA AAAAAA 



CTAACGAGAA 


TGACNCTCCG 


50 


CCN**GCACC 


CAGCCTCAAG 


ioo 


GGAGCGAGCC 


TACATCTATC 


150 


TTTAGATGAC 


GATCGATACG 


200 


GTGATTTTGG 


GCGAAGGCGA 


250 


CCTCTGCAAT 


AAGGGGTGGA 


300 


AAATAACCGT 


GGTCGAGATG 


350 


GGAGGGCCGA 


ATACGGCGGA 


500 


GXAGAGTAAT 


TAAATACGGC 


550 


^AAAAAATT 


TTTTTT CT AA 


600 






£16 
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<210> 9jLr 

<5.11> 1SG2 

<212> genomic DMA 

<213> Arabidopsis thaliana 

<400> 9 



50 



ATGAAAAAAA TATCTTCACA TTATTCGGTA GTCATAGCGA TACTCGTTGT. 
GGTGACGATG ACCTCGATGT GTCAAGCTGT GGGTAGCAAC GTGTACCCTT 10 0 

TGATTCTGGT TCCAGGAAAC GGAGGTAACC AGCTAGAGGT ACGGCTGSAC ISO 
AGAGAATACA AGCCAAGTAG TGTCTGGTGT AG CAGCTGGT TAT AT CCG AT 20 0 

T CATAAG AAG AGTGGTGGAT GGTTTAGGCT ATGGTTCGAT GCAGCAGTGT 
TATTSTCTCC CTTCACCAGG TGCTTCAGCG AT CG AATG AT GTTGTACTAT 

GACcrrrBvrr tgoatgatta ccaaaatgct cctggtgtcc aaacccggot 

TCCTCATTTC GGTTCGACCA AATCACTTCT ATACCTCGAC CCTCGTCTCC 
GGTTAGTACT TT C C AAG AT A TATCATTTTG GGACATTTGC ATAATGAACA 
AAATAGACAT AAATTTGGGG GATTATTGTT ATATCAATAT CCATTTATAT 
GCTAGTCGGT AATG TGAGTG TTATGTTAGT ATAGTTAATG TGAGTGTTAT 
GTGATTTTCC ATTTTAAATG AAGCTAGAAA GTTGTCGTTT AATAATGTTG 
CTATGTCATG AGAATTATAA GGACACTATG TAAATGTAGC TTAATAATAA 
GGTTTGATTT GCAC-AGATGC C ACAT CTTAC ATGGAACATT TGSTWLAACC 
TCTAGAGAAA AAATG CGGGT ATGTTAACGA CCAAACCATC CTAGGAGCTC 
CATATGATTT CAGGTACGGC CTGGCTGCTT CGGGCCACCC GTCCCGTGTA 
GCCTCACAGT TCCTACAAGA CCT CAAACAA TTGGTGGAAA AAACTAGCAG 
CGAGAACGAA GGAAAGC CAG TGATACTCCT CTC CC ATAGC CTAGGAGG AC 
TTTTCGTCCT CCATTTCCTC AACCGTACCA CCCCTTCATG GCGCCGCAAG 
TACATCAAAC ACTTTGTTGC ACTCGCTGCG CCATGGGGTG GGACGATCTC 
TCAC-ATGAAG ACATTTGCTT CTGGCAACAC ACTCGGTGTC CCTTTAGTTA 
ACCCTTTGCT GGT CAGACGG CATCAGAGGA CCTCCGAGAG TAACCAATGG 
CTACTTCCAT CTACCAAAGT GTTTCACSAC AGAACTAAAC CGCTTGTCGT 
AACT CC C CAG GTTAACTACA CAGCTTACGA GATGGATCGG TTTTTTGCAG 
AAT7GGAT7 CTCACAAGC-A GTTGTGCCT7 ACAAGACAAG AGTGTTGCCT 

SUBSTITUTE SHEET (RULE 26) 
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TTAACAGAGG AGGTGATGAC TCCGGGAGTG CCAGTCACTT GCATATATGG 
GAGAGGAGTT GATACACCGG AGGTTTTGAX GTATGGAAAA GGAGGATTCG 

ATAAGCAA.CC AGAG ATT AAG TATGGAGATG GAGATGGGAC GGTT AATTT G 

GCGAGCTTAG CAGCTTTGAA AGTCGATAGC TTGAACACCG TAGAGATTGA 

TGGAGTTTCG CATACATCTA TACTTAAAGA CGAGATCGCA CTTAAAGAGA 

TTATGAAGCA G ATTT C AATT ATTAATTATG AATTAGCCAA TGTTAATGCC . 
GTCAATGAAT GA 
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<210a 10 \y 

<?.11>3S9S 

<2 12> cycr.on-.ic d:;a 

<213> Arabiciopsir thaliana 

<400> 10 

ATGGGACCGA ATTCGAAATC AGTAACGG CT TCCTTCACCG TCATCGCCG7 5 0 

TTTTTTCTTG ATTTGCGGTG GCCGAACTGC GGTGGAGGAT GAGACCGAGT 10 0 

TTCACGGCGA CT ACT CG AAG CT^TraC^T-B. TXATCATXCC ■ 1=0 

TCGACGCAGC TACGAGCGTG GTCGATCCTT GACTGTCCAT ACACTCCGTT 200 

GG ACTTC AAT CCGCTCGACC TCGTATGGCT AG AC A CC ACT AAGGTCCGTG 25 0 

ATCTTCATTT CCTTCGCTCC TTATTCTGTC GGTCGAGTCA CTTCiTGATG 3 00 

AVTTCCAAGC G AAAT AT AG C AATGAAGCA-T GTCTCGTCTC TCTTATTGAT 3 50 

T CG TT C ATT A GTCAACAGTG ACGCTTCTGA ATCTGAGTTT AG AG TC AT AT 40 0 

AAAAC AG CTO ACTCGGCGAG TCTTTCCCAT CGCTTTTGGT TCGCTAAATG 450 

TAGCGCAATG AATGTGTAAT TACTCTGCGC TTTTTATTCA ACTAGATCTG 500 

CAAGTTTTTC A.CAGTGCTCA ATAGTAGTTA GAAAATGTTA GGTCATTTTA 55 0 

CTTGTGCATT GTGATTCTTT TGGTTGTTGC TTACTGATCG ACGTGATGGA 600 

TGGTTTACAG CTTCTTT CTG CTGTCAACTG CTGGTTTAAG TGTATGGTGC £5 0 

TAGATCCTTA TAATCAAACA GACCATCCCG AGTGTAAGTC ACGGCCTGAC 70 0 

AGTGGTCTTT CAGCCATCAC AGAATTGGAT CCAGGTTACA TAACAGGTAG 7S0 

TTTCGGATTT TTCTTTCTTT TGAGTTTTCT TCAATTTGAT ATCATCTTGT 8 00 

TCTGATATAA TATGGCTAAG TTCATTAATT TGGTCAATT T TCACGTCCTC eSO 

TTTCTACTGT CTGGAAAGAG TCGCTTAAGT GGTGTGTTGA GTTTGGTATA 90 0 

GAACCAAATG CAATTGTCGC .TGTTCCATAC GATTGGAGAT TGTCACCAAC BSD 

CAWTGGAA GAGCGTGACC TTTACTTTCA CAAGCTCAAG TTAGTCCTTA 10 0 0 

TCAGGCTAAT GTCTTTXA.TC TTCTCTTTTT ATGTAAGATA AGCxAAGAGC 105 0 

TCTGCTCGTC TTCCTTTTTG CAGGTTGACC TTTGAAACTG CTTTAAAACT 110 0 

CCSTGGCGGC CCTTCTATAG TATTTGCCCA TTCAATGGGT AATAATGTCT 115 0 

TCAGATACTT TCTGG AATGG CTGAGGCTAG AAATTGCACC AAAACATTAT 12 0 0 

TTGAAGTGGC TT GAT C AG CA TATCCATGCT TATTTCGCTG TTGGTACCGG X2 5 0 

CCrACTATCC TTAAGTTACC ATTTTATTTT TTCTCTAATT GGGGGAGTTA 13 0 0 

TGTTGTGACT TACTGGATTG AGCTCGATAC CTG ATTTGTT GTTGATTTAG 12 SO 
GAGCTCCTCT TCTTGGTTCT GTTGAGGCAA TCAAATCTAC TCTCTCTGGT 
GTAACGTTTG GCCTTCCTGT TTCTGAGGTG AC CT CTG ACT T CTC TTT AGx 
TTTAAHTir^T TGATATCAAC d\CCTCTTAT MCTCACTGG ATTTTCCTTT 

TGAAAGTATT ACTTTTGTTA ATTGAACTGC TGT ACGCG A i ATGGaATCTG IS SO 

TAGATCTTGA AGTGCTAGTT ATCAAAGAAC ATATTGTGGG TAGTATACCT 1500 

GTCAGCGGCC TTAGCTAATA CAACCAAACC ACATGTACAC TGATTTAGTT 165 0 

TT C AG ATT AT TATGGTAGAC TTTAAGTTGA CAAGAAACTT TGACTGAAAT 17 0 0 

CTTTTTATTT TAATAGGCTA TGATTTGTTT ATTGAAATCA TGTGACATAT 17 SO 

TGACATGCGC TT CTCATGTT TTTTGTT GGC AAGGCTTCAG GGAACTGCTC 16 0 0 

GGTTGTTGTC CAATTCTTTT GCGTCGTCAT TGTGGCTTAT GCCATTTTCA x35 0 

AAG AATTG C A AGGGTGATAA CACATTCTGG ACGCATTTTT CTGGGGGTGC 19 0 0 

TG CAAAG A A.- GATAAGCGCG TATACCACTG TGATGAAGAG G AAT AT C AA7 155 0 

CAAAATATTC TGGCTGGCCG AC AAAT ATT A TTAACATTGA AATTC CTTCC 2 00 

ACTAGCGGTT AGACTCTGTA TATGCAACTG TAACACTAAC AAAAGTTTCA 2 050 

CCAAGAATGT TCACTCTCAT ATTTCGTTCC TTTGATGTGT ATCCATCAGT 210 0 

TACAGAAACA GCTCTAGTCA ACATGACCAG CATGGAATGT GGCCTTCCCA 21a0 

CCCTTTTGTC TTTCACAGCC CGTGAACTAG CAGATCGGAC TCTTTTCAAA 22 00 

G CAAT AG AAG ACTATGACCC AGATAGCAAG ACGATGTTAC AC C AGTT AAA 22 5 0 

GAAGTACGTA CCTTTCTTTG T G AT AAGAAA TATTGCTCAT CGATCATCAC 2j00 

T7CCT GGCTT CTTGTACGTC AAATTG T TTT GTTTAAATCT CTATATCAAT 23 5 0 
TG7TCATATG CTTTGT CTTT CTTACTATAA GAAACAAGTA 

CCTTATTATT GATTATCAGT TCTCTCCTTA '7ATTATGGAA TGTCT-^-iC 2.3 0 

■ G TTT AC AGTT ATGA-ATGCAA AACGGGGTAT TTTAGTTGAT TGATTCTCTw 2*0.; 
ATT CTCTAGT TTGTTTTGAC T AATAG CG T C AATTTTGTTT 

7CTTTGTCAA. TTA.TATATAA CATGCTAACT ATACTTTTCA MTT«i~TvA 2 = 00 

TGATGACCCT GTTTTTAATC CTCTGACTCC TTGGGAGAGA CCA^CTAT-A . * „ 0 

AAJ^TG-ATT ^TGCATATAT GGTGCTCATC TAAAGACAGA GGTAT^ATGC . 2/00 

A7TCTCAATA TCACATTATG CGTTGACTTT G TT ATT AT AT TCCCCA^TC- 27^0 
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1150 
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CTTTCCAATA TCTTTTTCAA T^^IIT* IXaGCTCTCT GTCATAGGTT 
CCTA7TAACC GTTAAAGGTA TATCCTGATA ATTGGATCAT 

OGTTArTACT TTGCCCCAAG ^CAM^CT JSIgTGTCA AGGTAATTTT 
CMSOWMC ATTTATCAAA S£ TC TTCTO TATCAGTCTA 

CCCCAATGGC ACAAGTAAAA ^ATTAAACA ACTAAAATTT 

GTGGCATGTT ATCTCAGTTG CATAASwAA* T CAGTGCCTTA 

^GTACTTTT TTATCA.TTCC.TTTTGAG ; -TT tc ^GATAACT 

AAGTGGGAAG AGGTGTTGCA TGAAACA.GA CAJT aggAGTAGTC 
AGCAAAACAA AACTAACCCA TTTCTW^* SSJS^o TTCATATCTT 
GTGCTTTTAA AAAATTIGT? 3^*J^| TGGTTGATGG GAACOCXKA 
GATTGTGCAA TATCTGCAGG GAAGTTGGTT TTGAAATTAT 

CCIATAACTG GGGATGAGAC C «^=i?t ??£?TGCTTC TCGAACACTG 

cttcvtgcaa actactgaag ^ctaagataa ccg ctacttttat 

CTTGCTATG? TCTCTAGTAC ACTG^AAXAT T^AC -j-xcA CTCTCTTGGT 
TGATT ATG A A ATTG AT CT CT TATAGGTACC ^^TGGC TCCCCAGGTA 
GCAAGAATTG GCTCGGACCT ^^fj S^SxTTAA GTGTTACTTTT 
CTCTTTTTTA GTTCCTCACC TTATATAGA.T EZZl^cTAA AAATCATATA 
CTGGTTATGT GTTGATTTAC CTCCAAxTTG TTCT^ tTCTCATTCG 
TCf CTGTACT CCTCAAGAAC SgCGACGTA CATGTGGAAC 

GAAAATAAAA CAACAGCCAG TCATAGCTAA CATGACAAAA 

TAAATGTTGA TCATGAGCAT gSgACTCTG AGAGCA7TCC 28 = 0 

GCACCAAGGG TTAAGTACAT °^GXGGG TATTAA 3 8 9S 

GGGGAAGAGA ACCOCAGTCT C3GAGCTTGA 
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<2X0> lib* 
<211> 703 
<212> CDMA 
<2I2=» camaco 
<400> IX 

CTGGGGCCAA AAGTGAACAT AACAAGGACA 
TCAGATGTAC AAGTCCATCT AAATATAGAG 
CATTCCCAAT ATGACAAAGT TACCTACAAT 
AGSATTCTCA AAGTTTTCCA GGGACAAGAA 
AAAGCAAATC ACAGGAACAT TGTC AG AT CT 
GTGGCTTGAG ATCTGGCATG AT ATT CAT CC 
TTAC AAAAG G TGGTGTCTGA TCCTCACTAT 
GTTTGTATTG ACATTGTAAG' TATTGCAACA 
TGAGGGATCA GGACTGCTAT TGGGATTACG 
G CTG AAC ATT GTGAATACAG GTTACAATAT 
TATTCTCTTT TTGTGTATTT AGGCCACCTT 
GATATGTATT CGGGGATGTT CACCTGGGAC 
TCTACATCTC ACATCCTGTC ACACTATGTG 
TTGGCGGAAC AACAAGTTTG CACAAACATT 
TTCAGAGAG 



CCACAGTCAG AGCATGATGT SO 

CATCAACATG GTGAAGATAT. 10 0 

GAAGTACATA AC CTATTATG 15 0 

C AG C AGTTTG GGAGCTTGAT 200 

CCAGCTTTGA TGCGGGAGCT 25 0 

TGATAAAAAG TCCAAGTTTG 3 00 

TTTCTTCTAT AAATGTTTGA 3 SO 

AAAAGCAAAG CGTGGGCCTC 40 0 

GGAAAGCTCG ATGTGCATCG 4 SO 

TCAAATTATA TTTTG CAAAA 5 00 

TCCCCCGTCA CAACGATGCA 55 0 

AGAGTTGCAG ATTGAAGAGT £0 0 

TGATATTTAA GAAACTTTGT £5 0 

TGAAGAAGAA AGCGAAATGA 700 

709 
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